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ADENO-ASSOCIATED VIRUS VECTORS AND USES THEREOF 
rrnss-Reference to Related Applications 

This application is a continuation-in-part, and claims priority of invention 
under 35 U.S.C § 1 19(e), from U.S. application Serial No. 60/086,166, filed May 
5 20, 1998, currently pending, the disclosure of which is incorporated by reference 
herein. 

Rnrlfgrmmd of the Invention 

Adeno-associated virus (AAV) is a non-pathogenic parvovirus with a 
single-stranded DNA genome of 4680 nucleotides. The genome may be of either 

10 plus or minus polarity, and codes for two groups of genes. Rep and Cap (Bems 
et al, 1990). Inverted tenninal repeats (ITRs), characterized by palindromic 
sequences producing a high degree of secondary structure, are present at both 
ends of the viral genome. While other members of the parvovirus group 
replicate autonomously, AAV requires co-infection with a helper virus (i.e., 

1 5 adenovirus or herpes virus) for lytic phase productive replication. In the absence 
of a helper virus, wild-type AAV (wtAAV) establishes a latent, non-productive 
infection with long-term persistence by integrating into a specific locus on 
chromosome 19, AAVSl, of the host genome through a Rep-facilitated 
mechanism (Samulski, 1993; Linden et al., 1996; Kotin et al., 1992). 

20 In conn-ast to wtAAV, the mechanism(s) of latent phase persistence of 

recombinant AAV (rAAV) is less clear. rAAV integration into the host genome 
is not site-specific due to deletion of the AAV Rep gene (Pormazhagan et al., 
1997). Analysis of integrated proviral structures of both wild type and 
recombinant AAV have demonstrated head-to-tail genomes as the predominant 

25 structural forms. 

rAAV has recently been recognized as an extremely attractive vehicle for 
gene delivery (Muzyczka, 1992). rAAV vectors have been developed by 
substituting all viral open reading frames with a therapeutic minigene, while 
retaining the cis elements contained in two inverted terminal repeats (ITRs) 

30 (Samulski et al., 1987; Samulski et al, 1989). Following transduction, rAAV 
genomes can persist as episomes (Flotte et al., 1994; Afione et al., 1996; Duan et 
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al., 1998), or alternatively can integrate randomly into the cellular genome 
(Bems et al., 1996; McLaughlin et al., 1988; Duan et al., 1997; Fisher-Adams et 
al, 1996; Keams et al., 1996; Ponnazhagan et al., 1997). However, Uttle is 
known about the mechanisms enabling rAAV vectors to persist in vivo or the 

5 identity of cellular factors which may modulate the efficiency of transduction 
and persistence. Although transduction of rAAV has been demonstrated in vitro 
in cell culture (Muzyczka, 1992) and in vivo in various organs (Kaplitt et al., 
1994; Walsh et al, 1994; Conrad et al., 1996; Herzog et al., 1997; Snyder et al., 
1997), the mechanisms of rAAV-mediated transduction remain unclear. 

1 0 Moreover, while rAAV has been shown to be capable of stable, long- 

term transgene expression both in vitro and in vivo in a variety of tissues, the 
transduction efficiency of rAAV is markedly variable in different cell types. For 
example, rAAV has been reported to transduce lung epithelial cells at low levels 
(Halbert et al., 1997; Duan et al., 1998a), while high level, persistent transgene 

1 5 expression has been demonstrated in muscle, neurons and in other non-dividing 
cells (Kessler et al., 1996; Fisher et al., 1997; Herzog et al., 1997; Xiao et al., 
1996; Kaplitt et al., 1994; Wu et al., 1998; Ali et al., 1996; Bennett et al., 1997 
Westfall et al., 1997). These tissue-specific differences in rAAV mediated gene 
transfer may, in part, be due to variable levels of cellular factors affecting AAV 

20 infectivity (i.e., receptors and co-receptors such as heparin sulfate proteoglycan, 
FGFR-1, and aVp5 integrin) (Summerford et al., 1998; Qing et al., 1999; 
Summerford et al., 1999) as well as the latent hfe cycle (i.e., nuclear trafficking 
of virus and/or the conversion of single stranded genomes to expressible forms) 
(Qing et al., 1997; Qing et al., 1998). 

25 Muscle-mediated gene transfer represents a very promising approach for 

the treatment of hereditary myopathies and several other metabolic disorders. 
Previous studies have demonstrated remarkably efficient and persistent transgene 
expression to skeletal muscle in vivo with rAAV vectors. Applications in this 
model system include the treatment of several inherited disorders such as Factor 

30 IX deficiency in hemophilia B and epo deficiencies (Kessler et al., 1996; Herzog 
et al., 1997). Although the conversion of low-molecular-weight rAAV genomes 
to high-molecular-weight concatamers has been inferred as evidence for 
integration of proviral DNA in the host genome, no direct evidence exists in this 

2 
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regard (Xiao et al., 1996; Clark et al., 1997; Fisher et al. 1997). Also, the 
molecular processes and/or structures associated with episomal long-term 
persistence of rAAV genomes, e.g., in nondividing mature myofibers, remains 
unclear. 

5 Thus, there is a need for rAAV vectors that have increased stability 

and/or persistence in host cells. Moreover, there is a need for vectors useful to 
express large open reading frames. 

Summary o f the Invention 
The present invention provides a recombinant adeno-associated virus 

1 0 (rAAV) vector comprising a nucleic acid segment formed by the juxtaposition of 
sequences in the AAV inverted terminal repeats (ITRs) which are present in a 
circular intermediate of AAV. The circular intermediate was isolated from 
rAAV-infected cells by employing a recombinant AAV "shuttle" vector. The 
shuttle vector comprises: a) a bacterial origin of replication; b) a marker gene or 

1 5 a selectable gene; c) a 5 ' ITR; and d) a 3 ' ITR. Preferably, the recombinant 

AAV shuttle vector contains a reporter gene, e.g., a GFP, alkaline phosphatase or 
(J-galactosidase gene, a selectable marker gene, e.g., an ampicillin-resistance 
gene, a bacterial origin of replication, a 5' ITR and a 3' ITR. The vector is 
contacted with eukaryotic cells so as to yield transformed eukaryotic cells. Low 

20 molecular weight DNA ("Hirt DNA") from the transformed eukaryotic cells is 
isolated. Bacterial cells are contacted with the Hirt DNA so as to yield 
transformed bacterial cells. Then bacterial cells are identified which express the 
marker or selectable gene present in the shuttle vector and which comprise at 
least a portion of a circular intermediate of adeno-associated virus. Also, as 

25 described below, it was found that circularized intermediates of rAAV impart 
episomal persistence to linked sequences in Hela cells, fibroblasts and muscle 
cells. In HeLa cells, the incorporation of certain AAV sequences, e.g., ITRs, 
from circular intermediates into a heterologous plasmid conferred a 1 0-fold 
increase in the stability of plasmid-based vectors in HeLa cells. Unique features 

30 of these transduction intermediates included the in vivo circularization of a 

head-to-tail monomer as well as multimer (concatamers) episomal viral genomes 
with associated specific base pair alterations in the 5' viral D-sequence. The 
majority of circular intermediates had a consistent head-to-tail configuration 
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consisting of monomer genomes (<3 kb) which slowly converted to large 
multimers of >12 kb by 80 days post-infection in muscle. Importantly, 
long-term transgene expression was associated with prolonged (80 day) episomal 
persistence of these circular intermediates. Thus, in vivo persistence of rAAV 

5 can occur through episomal circularized genomes which may represent 

prointegration intermediates with increased episomal stability. Moreover, as 
described below, co-infection with adenovirus, at high multiplicities of infection 
(MOI) capable of producing early adenoviral gene products, led to increases in 
the abundance and stability of AAV circular intermediates which correlated with 

1 0 an elevation in transgene expression from rAAV vectors. Thus, these results 
demonstrate the existence of a molecular structure involved in AAV transduction 
which may play a role in episomal persistence and/or integration. 

Further, these results may aid in the development of non-viral or viral- 
based gene deHvery systems having increased efficiency. For example, 

1 5 therapeutic or prophylactic therapies in which the present vectors are useful 
include blood disorders (e.g., sickle cell anemia, thalassemias, hemophilias, and 
Fanconi anemias), neurological disorders, such as Alzheimer's disease and 
Parkinson's disease, and muscle disorders involving skeletal, cardiac or smooth 
muscle. In particular, therapeutic genes useful in the vectors of the invention 

20 include the P-globin gene, the y-globin gene, the cystic fibrosis transmembrane 
conductance receptor gene (CFTR), the Fanconi anemia complementation group, 
a gene encoding a ribozyme, an antisense gene, a low density lipoprotein (LDL) 
gene, a tyrosine hydroxylase gene (Parkinson's disease), a glucocerebrosidase 
gene (Gaucher's disease), an arylsulfatase A gene (metachromatic 

25 leukodystrophies) or genes encoding other polypeptides or proteins. Also within 
the scope of the invention is the inclusion of more than one gene in a vector of 
the invention, i.e., a pluraHty of genes may be present in an individual vector. 
Further, as a circular intermediate may be a concatamer, each monomer of that 
concatamer may comprise a different gene. 

30 For viral-based delivery systems, helper-free virus can be prepared (see 

WO 95/13365) from circular intermediates or vectors of the invention. 
Alternatively, liposomes, plasmid or virosomes may be employed to deliver a 
vector of the invention to a host or host cell. 

4 
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The increased persistence of circular intennediates or vectors having one 
or a plurality of ITRs may be due to the primary and/or secondary structure of 
the ITRs. The primary structure of a consensus sequence (SEQ ID N0:3) of 
ITRs fomied by the juxtaposition and physical (phosphodiester bond) linkage of 

5 ITRs from AAV is shown in Figure 2C. However, as described hereinbelow, 
each ITR sequence may be incomplete, i.e., the ITR may be a subunit or portion 
of the full length ITRs present in the consensus sequence. Moreover, preferably, 
an isolated DNA segment of the invention is not the 165 bp double DD sequence 
(SEQ ID NO:7) disclosed in U.S. Patent No. 5,478,745, referred to as a "double 

10 sequence". 

Moreover, the formation, persistence and/or abundance of molecules 
having the ITR sequences of the invention may be modulated by helper virus, 
e.g., adenoviral proteins and/or host cell proteins. Thus, the circular 
intermediates or vectors of the invention may be useful to identify and/or isolate 

1 5 proteins that bind to the ITR sequences present in those molecules. 

Therefore, the present invention provides an isolated and purified DNA 
molecule comprising at least one DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stabihty, persistence or abundance of the 

20 isolated DNA molecule in a host cell. Preferably, the DNA molecule comprises 
at least a portion of a left (5') inverted terminal repeat (ITR) of adeno-associated 
virus. Also preferably, the DNA molecule comprises at least a portion of a right 
(3')-inverted terminal repeat of adeno-associated virus. The invention also 
provides a gene transfer vector, comprising: at least one first DNA segment, a 

25 biologically active subunit or variant thereof, of a circular intermediate of adeno- 
associaied virus, which DNA segment confers increased episomal stability or 
persistence of the vector in a host cell; and a second DNA segment comprising a 
gene. Preferably, the second DNA segment encodes a therapeutically effective 
poK-pepiide. The first DNA segment comprises ITR sequences, preferably at 

30 Icasi about 1 00, more preferably at least about 300, and even more preferably at 
least about 400, bp of adeno-associated virus sequence. A preferred vector of the 
invention is a plasmid. 

5 
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Thus, the vector of the invention is useful in a method of delivering 
and/or expressing a gene in a host cell, to prepare host cells having the vector(s), 
and in the preparation of compositions comprising such vectors. To deliver the 
gene to the host cell, a recombinant adenovirus helper virus may be employed. 
5 As described hereinbelow, the tibialis muscle of mice was co-infected 

with rAAV Alkaline phosphatase (Alkphos) and GFP encoding vectors. The 
GFP shuttle vector also encoded ampicillin resistance and a bacterial origin of 
replication to allow for bacterial rescue of circular intermediates in Hirt DNA 
from infected muscle samples. There was a time dependent increase in the 

1 0 abundance of rescued plasmids encoding both GFP and Alkphos that reached 
33% of the total circular intermediates by 120 days post-infection. Furthermore, 
these large circular concatamers were capable of expressing both GFP and 
Alkphos encoded transgenes following transient transfection in cell lines. Thus, 
concatamerization of AAV genomes in vivo occurs through intermolecular 

1 5 recombination of independent monomer circular viral genomes. Therefore, a 
plurality of DNA segments, each in an individual rAAV vector, may be 
delivered so as to result in a single DNA molecule having a plurality of the DNA 
segments. For example, one rAAV vector comprises a first DNA segment 
comprising a 5* ITR linked to a second DNA segment comprising a promoter 

20 operably linked to a third DNA segment comprising a first open reading frame 
linked to a fourth DNA segment comprising a 3* ITR. A second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment 
comprising a second open reading frame linked to a fourth DNA segment 

25 comprising a 3* ITR. 

In another embodiment, one rAAV vector comprises a first DNA 
segment comprising a 5* ITR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising the 5 ' end of an 
open reading frame linked to fourth DNA segment comprising a 5' splice site 

30 linked to a fifth DNA segment comprising a 3' ITR. The second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a 3' splice site linked to a third DNA segment comprising 
the 3 ' end of the open reading fii^e linked to a fourth DNA segment comprising 

6 

BNSDOCID; <WO 9960146A1_IA> 



wo 99/60146 



PCT/US99/ni97 



a 3* ITR. Preferably, the second and third DNA segments together comprise 
DNA encoding, for example, CTFR, factor vm, dystrophin, or erythropoietin. 
Also preferably, the second DNA segment comprises the endogenous promoter 
of the respective gene, e.g., the epo promoter. 
5 Thus, the invention provides a composition comprising: a first adeno- 

associated virus vector comprising linked DNA segments and at least a second 
adeno-associated virus comprising linked DNA segments. The linked DNA 
segments of the first vector comprise: a first DNA segment comprising a 5* 
ITR; a second DNA segment comprising at least a portion of an open reading 

10 firame operably linked to a promoter, wherein the DNA segment does not 
comprise the entire open reading frame; a third DNA segment comprising a 
splice donor site; and iv) a fourth DNA segment comprising a 3* ITR. The 
linked DNA segments of the second vector comprise a first DNA segment 
comprising a 5' ITR; a second DNA segment comprising a splice acceptor site; a 

1 5 third DNA segment comprising at least a portion of an open reading firame which 
together with the second DNA segment of the first vector encodes a fixll-length 
polypeptide; and a fourth DNA segment comprising a 3' ITR. Preferably, the 
second DNA segment of the first vector comprises a first exon of a gene 
comprising more than one exon and the third DNA segment of the second vector 

20 comprises at least one exon of a gene that is not the first exon. 

The invention also provides a method to transfer and express a 
polypeptide in a host cell. The method comprises contacting the host cell with at 
least two rAAV vectors. One rAAV vector comprises a first DNA segment 
comprising a 5 *ITR linked to a second DNA segment comprising a promoter 

25 operably linked to a third DNA segment comprising a first open reading firame 
linked to a fourth DNA segment comprising a 3* ITR. A second rAAV vector 
comprises a first DNA segment comprising a 5* ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment 
comprising a second open reading frame linked to a fourth DNA segment 

30 comprising a 3 *ITR. Alternatively, one rAAV vector comprises a first DNA 
segment comprising a 5'ITR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising the 5' end of an 
open reading fi^e linked to fourth DNA segment comprising a 5' splice site 
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linked to a fifth DNA segment comprising a 3* ITR, The second rAAV vector 
comprises a first DNA segment comprising a 5* ITR linked to a second DNA 
segment comprising a 3' sphce site linked to a third DNA segment comprising 
the 3 ' end of the open reading fi^e linked to a fourth DNA segment comprising 

5 a 3 ITR. The host cell is preferably contacted with both of the vectors, 

concurrently, although it is envisioned that the host cell may be contacted with 
each vector at a different time relative to the contact with the other vector(s). 

Also provided is a method in which the composition of the invention is 
administered to the cells or tissues of an animal. For example, rAAV vectors 

1 0 have shown promise in transferring the CFTR gene into airway epithelial cells of 
animal models and nasal sinus of CF patients. However, high level expression 
of CFTR has not been achieved due to the fact that AAV cannot accommodate 
the full-length CFTR gene together with a potent promoter. A number of studies 
have tried to optimize rAAV-mediated CFTR expression by utilizing truncated 

1 5 or partially deleted CFTR genes together with stronger promoters. However, it 
is currently unknown what effect deletions within the CFTR gene may have on 
complementation of bacterial colonization defects in the CF airway. Therefore, 
the present invention includes the administration to an animal of a composition 
of the invention comprising at least two rAAV vectors which together encode 

20 CTFR. The present invention is useful to overcome the current size limitation 

for n-ansgenes within rAAV vectors, and allows for the incorporation of a larger 

transciptional regulatory region, e.g., a stronger heterologous promoter or the 

endogenous CFTR promoter. 

Briff Tli^grriptlnn nf the Figures 

25 Figure 1 . Structure of proviral shuttle vector and the predicted structure 

of rAAV circular intemiediate monomers. With the aid of a rAAV cw-acting 
plasmid, pCisAV.GFP3ori (Panel A), AV.GFP3ori recombinant virus was 
produced (Panel B). This vector encoded a GFP transgene cassette, an 
ampicillin resistance gene (amp), and a bacterial repHcation origin (ori). The 

30 predominant form of circular intermediates isolated following transduction of 
Hela cells with AV.GFP3ori consisted of head-to-tail monomers (Panels C and 
D). 
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Figure 2. Stmctural analysis of rAAV circular interrnediates in Hela 
cells. Circular rAAV intermediate clones isolated jfrom AV.GFP3ori infected 
Hela cells were analyzed by diagnostic restriction digestion with Asel, SphI, and 
PstI together with Southern blotting against ITR, GFP, and Staffer ^^P-labeled 
5 probes. In panel A, four clones representing the diversity of intermediates found 
(pl90, p333, p280, and p345) gave a diagnostic PstI (P) restriction pattern (3kb 
and 1 .7kb bands) consistent with a circular monomer or multimer intact genome 
[agarose gel (Left) and Southern blot (Right)]. SphI (S) digestion demonstrated 
existence of a single ITR (pl90), two ITRs in a head-to-tail orientation (p333 

10 and p280), and three ITRs (p345) in isolated circular intermediates. The 

restriction pattern of pCisAV.GFP3ori (U; uncut, P; PstI cut, S; SphI cut) and 1 
kb DNA ladder (L) are also given for comparison. One additional circular form 
(p340) was repetitively seen and had an unidentifiable structure which lacked 
intact ITR sequences. Circular concatamers were identified by partial digestion 

15 with Asel for clones p280 (dimer) and p333 (monomer) as is shown in Panel B. 
Sequence analysis (Panel C) of six clones with identical restriction patterns to 
p333 (Panel A) was performed using primers (indicated by arrows) juxtaposed to 
the partial p5 promoter (dotted line) and ITRs (solid line). The top sequence 
represents the proposed head-to-tail structure of intact ITR arrays with alignment 

20 of sequence derived fi'om individual clones. The junction of the inverted ITRs is 
marked by inverted arrowheads (at 251bp). Several consistent bp changes 
(shaded) were noted in the 5 'ITR D-sequence (boxed) within four clones (p79, 
p81, p87, and p88). All bp changes are indicated in lower case letters. 

Figure 3. Adenovirus augments AAV circular intermediate formation in 

25 Hela cells. Infection of Hela cells with increasing doses (0, 500, and 5000 
particles/cell) of recombinant El-deleted adenovirus (Ad.CMVlacZ) leads to 
substantial expression of E2a 72kd DNA Binding Protein, as demonstrated by 
indirect immunofluorescent staining for DBP at 72 hours post-infection (Panel 
A). Co-infection of Hela cells with Ad.CMVlacZ (5000 particles/cell) and 

30 AV.GFP3ori (1000 DNA particles/cell) led to substantial augmentation of rAAV 
GFP transgene expression (Panel B). Augmentation in rAAV GFP transgene 
expression in the presence of increasing amounts (0, 500,* 5000 and 10000 
particles/cell) of recombinant Ad.CMVlacZ was quantified by FACS analysis at 
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72 hour post-infection (Panel C). Results demonstrate the mean (+/-SEM) for 
two experiments performed in duplicate. In addition, an aliquot of cells was split 
(1:10) at the time of FACS analysis and GFP colony forming units (CFU) per 
lOX field were quantified at 6 days (CPE denotes significant cytopathic effects 
5 at an adenoviral MOI of 1 0,000 particles/cell and was not quantified for GFP 
colonies). Hirt DNAs from AV.GFP3ori (1000 DN A particles/cell) infected 
Hela cells with or without co-infection with Ad.CMVlacZ (5,000 particles/cell) 
were used to transform £, colL The total number of ampicillin-resistant bacterial 
CFU (Panel D) and total number of head-to-tail circular intermediates CFU 

10 (Panel E) are given for a representative experiment. Greater than 20 clones for 
each time point were evaluated by Southern blot (see Figure 2 for detail). Zero 
hour controls were performed by mixing an equivalent amount of AV.GFP3ori 
virus as used in experiments with mock infected cellular lysates prior to Hirt 
purification. Panel F depicts the abundance of head-to-tail circular intermediates 

15 as a percentage of total ampicillin-resistant bacterial CFU isolated from Hirt 
DNA. 

Figure 4. Formation of r AAV head-to-tail circular intermediates 
following in vivo transduction of muscle. The tibialis anterior muscle of 4-5 
week old C57BL/6 mice were infected with AV.GFP3ori (3 X 1010 particles) in 

20 HEPES buffered saline (30 GFP expression (Panel A) was analyzed by 
direct immunofluorescence of freshly excised tissues and/or in formalin-fixed 
cryopreserved tissue sections in four independently injected muscles harvested at 
0, 5, 10, 16, 22 and 80 days post-infection. GFP expression was detected at low 
levels beginning at 10 days and was maximum at 22 days post-infection. 

25 Expression remained stable to 80 days at which time greater than 50% of the 
tissue was positive (see 80 day tissue cross section counter stained with 
propidium iodide, panel A). Hirt DNA was isolated from muscle samples at 
each of the various time points and after points was used to transform £. coli. 
Rescued plasmids (p439, pl6, pl7) were analyzed by Southern blotting in Panel 

30 B showing an agarose gel on left and ITR probed blot on right. Uiuncut, P:PstI 
cut, and S:SphI cut. The schematic drawing of the most predominant type of 
head-to-tail circular AAV intermediate plasmids rescued from bacteria is given 
in the right of Panel B and shows the structure of pl7 as an example. Other 
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typical clones included those with less than two ITRs as shown for pi 6. SphI 
digestion of pl6 and pl7 plasmids released ITR hybridizing fragments of 
approximately 140 and 300 bp, respectively. The slightly lower mobility then 
predicted for these ITR fragments likely represents anomalous migration due to 
5 the high secondary structure of inverted repeats within ITRs. Sequence analysis 
of pl7 and pl6 using nested primers to 5' and 3'-ITRs also confirmed the ITR 
orientations shown to the right of the gel. Additional restriction enzyme 
analyses to detennine this structure included double and single digests with 
SphI, PstI, Asel, and/or Smal. An example of an atypical clone (p439) rescued 
10 from bacteria with unknown structure is also shown. 

Figure 5. Frequency of circular intermediate formation in muscle 
following transduction with rAAV. Hirt DNAs isolated from rAAV infected 
tibialis muscle were used to transform £. coli and the rescued plasmids analyzed 
by Southern blotting (greater than 20 clones were analyzed from at least two 
1 5 independent muscle samples for each time point). The averages of total 

head-to-tail circular intermediate clones (line) and ampicillin resistant bacterial 
clones (bar) isolated from each tibialis anterior muscle at 0, 5, 10, 16, 22 and 80 
days post-infection are summarized in Panel A. Only plasmids which contained 
1-2 ITRs were included in the estimation of total head-to-tail circular 
20 intermediates. Plasmids which demonstrated an absence of ITR hybridizing 
SphI fragments (between 150 to 300 bp) were omitted from the calculations. 
Panel B demonstrates the diversity of ITR arrays found in head-to-tail circular 
intermediates at 80 days post-infection. This panel depicts a Southern blot 
probed with ITR sequences and represents circular intermediates with 1-3 ITRs. 
25 SphI fragments which hybridize to ITR probes indicate the size of inverted ITR 
arrays (marked by arrows to right of gel). Additional restriction enzyme analysis 
was used to determine the structure of monomer and multimer circular 
intermediates. Examples are shown for two multimer (pl36 and pl43) circular 
intermediates which contain approximately three AAV genomes. Undigested 
30 plasmids of pl36 and pl43 migrate greater than 12 kb and is contrasted to the 
most predominant form of head-to-tail undigested circular intermediates at 22 
days which migrate at 2.5 kb. The digestion pattern of pi 36 is consistent with a 
uniform head-to-tail configuration of three genomes which is indistinguishable 
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from digestion patterns of p 1 39 which contains one circularized genome 
(undigested pi 39 migrates at 2.5 kb, data not shown, also see examples pi 7 in 
Figure 4). In contrast, pl36 depicts a more complex head-to-tail multimer 
circular intermediate which has various deletions and duphcations within the 

5 ITR arrays. Predicted structure of five representative intermediates is 
schematically shown in Panel C. 

Figure 6. Molecular size of circular intermediates in muscle. Hirt DNA 
from AV.GFP3ori infected muscle was size fractionated by electrophoresis and 
various molecular weight fractions transformed into £. colL Results demonstrate 

1 0 the abundance of circular intermediates at each of the given molecular weights at 
22 and 80 days post-infection with the rAAV shuttle vector. Structure of circular 
intermediates were confirmed by Southern blot restriction analysis. 

Figure 7. Head-to-tail circular intermediates demonstrate increased 
stability of GFP expression following transient transfection in Hela cells. 

1 5 Subconfluent monolayers of Hela cells were co-transfected with p8 1 , p87, or 
pCMVGFP and pRSVlacZ as an internal control for transfection efficiency as 
described in the methods. Panel A demonstrates the expansion of GFP clones 
after one passage (arrows). Quantification ofclone size and numbers are shown 
in Panel B. Clone size represents the mean raw values while clone niunbers are 

20 normalized for transfection efficiency as determined by X-gal staining for 

pRSVlacZ. The data at the top of bar graph values for each construct in Panel B 
represents quantification of GFP clones after second passage (also normalized 
for transfection efficiency). Results indicate the mean (+/-SEM) of duphcate 
experiments with greater than 20 fields quantified for each experimental point. 

25 The persistence of transfected p8 1 and pCMVGFP plasmid DNA at passage-7 
post-transfection was evaluated by genomic Southem blot of total cellular DNA 
hybridized against ^^P-labeled GFP probe (Panel C, results from two independent 
transfections are shown). U:uncut, C:PstI cut. The migration of uncut dimer and 
monomer plasmids forms are marked on the left. PstI digestion of the plasmids 

30 results in bands at 4.7 kb (pCMVGFP, single PstI site in plasmid) and 1 .7 kb 
(p81, two PstI sites flanking the GFP gene). To determine whether the 
head-to-tail ITR array within circular intermediates was responsible for increases 
in the persistence of GFP expression, the head-to-tail ITR DNA element was 
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subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). Results in 
Panel D compare the extent of luciferase transgene expression following 
transfection with pGL3 and pGL3(ITR) at 10 days (passage-2) post-transfection. 
Results are the mean (+/-SEM) for triplicate experiments and are normalized for 
5 transfection efficiency using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 

Figure 8. Identification of adenoviral genes responsible for augmentation 
of AAV circular intermediate formation. Hela cells were infected with 
AV.GFP3ori (1000 DNA particles/cell) in the presence of wtAdS, dK02 (E2a- 

1 0 deleted), and d\ 1 004 (E4-deleted) adenovirus (at the indicated MOIs). Total 
number of head-to-tail circular intermediates from Hirt DNA and the level of 
augmentation of GPP transgene expression (as determined by FACS) was 
quantified at 24 hours post-infection (Panel A). Results are die average of 
duplicate experiments. Panel B depicts results bom Southern blot analysis of 

1 5 Hirt DNA following hybridization to a GFP P^^-labeled probe. DNA loads were 
10% of the total Hirt yield from a 35 mm plate of Hela cells. Infections were 
carried out identically to that described for Panel A. Arrows mark replication 
form concatamers (RfJ, dimers (Rfj), monomers (RfJ, and single-stranded 
AAV genomes (ssDNA). 

20 Figure 9. Model for independent mechanistic interactions of adenovirus 

with lytic and latent phase aspects of the AAV Ufe cycle. The adenoviral E4 
gene has been shown to augment the level of rAAV second strand synthesis 
giving rise to replication form dimers (RQ and monomers (R^) (Figure 8B). 
This augmentation leads to substantial increases in transgene expression from 

25 rAAV vectors and most closely mirrors lytic phase replication of wtAAV as 

head-to-head and tail-to-tail concatamers. In contrast, E4 expression inhibits the 
formation of head-to-tail circular intermediates of AAV. Hence, it q)pears that 
increases in the amount of Rf^ and R4 double stranded DNA genomes does not 
increase the extent of circular intermediate formation. Such findings suggest that 

30 conversion of R^ and Rf^ to circular intermediates does not likely occur and 
implicates two mechanistically distinct pathway for their formation. In support 
of this hypothesis, adenoviral E2a gene expression does not enhance the 
formation of R^ and Rf^ genomes but rather increase the abundance and/or 
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Stability of head-to-tail circular intermediates. Furthermore, in the absence of 
E4, E2a gene expression does not lead to augmentation of rAAV transgene 
expression. Since circular intermediates have increased episomal stability in 
muscle and in Hela cells, this molecular structure may be important in the latent 
5 phase of AAV persistence. Alternatively, these circular intermediates may 
represent pre-integration complexes as previously hypothesized for Rep 
facihtated integration. In the absence of Rep, circular intermediates may 
accumulate episomally in rAAV infected cells. In summary, these findings 
support the notion that adenovirus may modulate both latent and lytic aspects of 
10 the AAV life cycle. 

Figure 10. Individual chemical sequence of SphI fiagments from p81 (A; 
SEQ ID N0:4), p79 (B; SEQ ID N0:5), and pl202 (C; SEQ ID N0:6) AAV 
circular intermediates. The ends of the sequence (underlined) represent SphI 
restriction enzyme sites within head-to-tail circular AAV genomes cloned with 
1 5 the AV-GFP3ori shuttle virus. 

Figure 1 1 . Chemical sequence homology of three AAV circular 
intermediates with various conformations of ITR arrays. Diversity in ITR arrays 
are evident from the non-conserved bases marked in lower case. The ends of the 
sequence (underlined) represent SphI restriction enzyme sites within head-to-tail 
20 circular AAV genomes cloned with the AV.GFP3ori shuttle virus. 

Figure 12 A. Palindromic repeat structure derived from chemical 
sequencing of AAV circular intermediate isolate p81. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 
Figure 12B. Palindromic repeat structure derived from chemical 
25 sequencing of AAV circular intermediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 

Figure 12C. Palindromic repeat structure derived from chemical 
sequencing of AAV circular intemiediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 
30 Figure 13. Persistence of GFP expression m developing Xenopus 

embryos microinjected with AAV circular intermediate isolate p8 1 . The extent 
of GFP fluorescence in tadpoles reflects the stability of episomal or integrated 
microinjected plasmids. Bright field image on the left is of the p81 injected 
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embryo. The p81 injected embryo depicts fluorescence in nearly all cells by one 
week post-injection. In contrast, a mosaic pattern of expression in a minority of 
cells in pCisAV.GFPori injected embryos. The pCisAV.GFPori plasmid 
contains the identical promoter sequences driving GFP gene expression and two 
5 ITRs separated by stuffer sequence. These findings demonstrate that specific 
structural characteristics found within AAV circular intermediates are 
responsible for increased persistence of transgene expression. 

Figure 14. Mechanistic scheme for determining pathways for rAAV 
circular concatamer formation. The two independent vectors used in these 

10 studies, AV.Alkphos and AV.GFP3.ori, are shovm in Panel A . Restriction sites 
important in the structural analysis of circular intermediates are also shown. In 
Panel B, a schematic representation of two potential models for circular 
concatamer formation is depicted, along with the methods to experimentally 
differentiate which of these processes is active in muscle. Following co- 

15 infection of the tibiaUs muscle with AV.Alkphos and AV.GFP3.ori, all 
subsequently rescued plasmids arise solely fi-om circular intermediates 
containing AV.GFPSori genomes. If rolling circular replication is the sole 
mechanism of concatamerization, only GFP expressing plasmids should be 
rescued. In contrast, if intermolecular recombination between independently 

20 formed monomer circular intermediates is the mechanism of concatamerization, 
both GFP and GFP/Alkphos expressing plasmids should be rescued. 

Figure 15. Co-infection of tibialis muscle of mice with AV.Alkphos and 
AV.GFP3ori, Transgene expression of rAAV infected tibialis muscle was 
determined at 14, 35, 80 (Panels A and A'), and 120 (Panels B-D) days 

25 following co-infection with 5x10^ DNA particles each of AV.Alkphos and 
AV.GFP3ori. The time course of transgene expression started around 14 days 
and peaked by 35-80 days. The extent of co-infection of myofibers with both 
Alkphos and GFP rAAV was determined in serial sections of 80 and 120 day 
post-infection muscle samples. Panels A-C represent GFP fluorescence of 

30 formalin fixed, cryoprotected sections, while panels A'-C depict the 

histochemical staining for Alkaline phosphatase in adjacent serial sections. A 
short staining time (7 minutes) was necessary to observe variation in staining 
levels for comparison to GFP. It was found that longer staining times 
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(30 minutes) saturated the Alkphos signal. The boxed region in panels B and B ' 
are enlarged in panels C and C, respectively. A more precise correlation of GFP 
and Alkphos staining in myofibers is given in Panel D in which co-localization 
of GFP and Alkphos expression was examined in the same section of a 120 day 
5 post-infected sample. This was perfomied by photographing the GFP 

fluorescent image prior to staining for Alkphos activity. The left panel of D 
shows a high power Nomarski photomicrograph of a group of myofibers (traced 
in red) , while the corresponding GFP and Alkphos staining pattems are shown 
in the right panel. Photomicrographs of Alkphos staining were taken with a red 
1 0 filter to allow for superimposition of staining pattems with GFP fluorescence. 
Co-expression of Alkphos and GFP is shown within myofibers as a 
yellow/orange color. Myofibers are marked as follows: (-) negative for both 
Alkphos and GFP, (*) positive for only GFP, and (+) positive for both GFP and 
Alkphos. 

15 Figure 16. Rescue of circular intermediates and characterization of DNA 

hybridization patterns. Using the ampicillin resistance gene (amp) and bacterial 
ori incorporated into the AV.GFP3ori vector, the extent of circular intermediate 
formation was assessed by rescuing amp resistant plasmids following 
transformation of 1/5 the isolated Hirt DNA into £. coli Sure cells. Twenty 

20 plasmids fi-om each muscle sample were prepared and analyzed by slot blot 
hybridization against GFP, Alkphos, and Amp ^^P-labeled DNA probes. A 
representative group demonstrating the hybridization pattems is shown in Panel 
A. Panel B depicts the mean (+/-SEM) number of rescued bacterial plasmids 
that hybridized to either GPF alone, or to both GFP and Alkphos probes, 

25 following transformation of 1/5**^ of the Hirt DNA. These numbers were 

calculated fi-om the percentage of plasmids hybridizing to GPF and/or Alkphos 
and the total CFU plating efficiency derived fi-om the original transformation. In 
total, 3 independent muscle samples were analyzed for a total of 60 plasmids at 
each time point. The percentage of GFP hybridization positive rescued plasmids 

30 that also demonstrated hybridization to Alkphos is shown in Panel C. These data 
demonstrate an increase in the abundance of rescued GFP/Alkphos co-encoding 
circular intermediates over time. 
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Figure 17. Transgene expression from rescued circular intermediates. 
Rescued circular intermediate piasmids were transfected into 293 cells for 
assessment of their ability to express encoded transgenes. In these studies all 
GFP hybridization positive clones from at least two muscles were tested for each 

5 time point and scored for their abihty to express GFP and Alkaline phosphatase. 
In total at least 40 clones were evaluated for each time point. Three patterns of 
transgene expression were observed following transfection of these piasmids: I) 
no gene expression (Panel A), U) GFP expression only (Panel B), and HI) GFP 
and Alkphos expression (Panel C). Panels A-C depict Nomarski 

1 0 photomicrographs (left) of GFP fluorescent fields (center) and Alkphos staining 
of a different field from the same culture (right). The percentage of GFP 
hybridization positive clones that also expressed GFP is shown in Panel D. 
Additionally, this panel illustrates the percentage of GFP expressing clones also 
expressing Alkphos. 

15 Figure 18. Structural analysis of bi-fimctional concatamer circular 

intermediates. To fully characterize the nature of GFP and Alkphos co- 
expressing circular intermediates, detailed structural analyses were performed 
using restriction enzyme mapping and Southern blot hybridization with GFP, 
Alkphos, and ITR ^^P-labeled probes. Results from Southern blot analysis of 

20 plasmid clone*33 (Panel A) and clone #5 (Panel C) are given as representative 
examples of circular intermediates isolated from 80 and 35 day Hirt DNA of 
rAAV infected muscle, respectively. Agarose gels were run in triplicate for each 
of these clones and Southern blot filters were hybridized with one of the three 
DNA probes as indicated below each autoradiogram. Molecular weights (kb) are 

25 indicated to the left of the ethidium stained agarose gel and restriction enzymes 
are marked on the top of each gel/filter. Panels B and D give the deduced 
structure of plasmid clones #33 and #5, respectively, as based on Southem blot 
analysis. For ease of comparison with the restriction maps of the viral genomes 
given in Figure 14A, the position of restriction enzyme sites (kb) are marked 

30 u ith the indicated orientation of intact viral genomes. However, in clone #33 a 
deletion occurred between the Asel and Hindlll site of a head-to-tail array 
between AV.Alkphos and AV.GFP3ori, as reflected by a 900 bp reduction in the 
anticipated size of Hindlll/NotI and Clal/Asel fragments (marked by asterisks in 
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Panel A). Furthermore, the SphI site flanking an ITR was ablated in clone #5 
(bands effected by this deletion are marked by asterisks in Panel C). The 
deletion is not reflected in the overall concatamer since the exact region involved 
and/or the size of the deletion is imclear. Additionally, chemical sequence 

5 evidence of rescued circidar intermediates suggests that the predominant form of 
ITR arrays may be in a double-D structure (ie., one ITR flanked by two D- 
sequence rather than two ITRs) and hence ITR arrays containing fragments may 
appear 147 bp shorter than indicated. However, to more easily depict the 
orientation of viral genomes, the position of 5 'and 3' ITRs is indicated rather 

1 0 than representing a single ITR at these junctions. 

Figure 19. Application of rAAV circular concatamers to deliver trans- 
splicing vectors with large gene inserts. Panel A depicts two rAAV vectors 
encoding two halves of a cDNA (red) and flanked by splice site consensus 
sequences (brown). Panel B depicts one potential type of intermolecular 

1 5 concatamer following co-infection of cells with the independent vectors shown 
in panel A. Full length transgene mRNA can then be produced by splicing 
between these two vector encoded sequences within circular concatamers. 
Detailed DescHptinn of the Invention 

Definitinns 

20 As used herein, the terms "isolated and/or purified" refer to in vitro 

preparation, isolation and/or purification of a nucleic acid molecule of the 
invention, so that it is not associated with in vivo substances. 

As used herein, a DNA molecule, sequence or segment of the invention 
preferably is biologically active. A biologically active DNA molecule of the 

25 invention has at least about 1%, more preferably at least about 1 0%, and more 
preferably at least about 50%, of the activity of a DNA molecule comprising ITR 
sequences from a circular intermediate of AAV, e.g., a DNA molecule 
comprising SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO:6, or a 
subunit or variant thereof The activity of a nucleic acid molecule of the 

30 invention can be measured by methods well known to the art, some of which are 
described hereinbelow. For example, the presence of the DNA molecule in a 
recombinant nucleic acid molecule in a host cell results in episomal persistence 
and/or increased abundance of the recombinant molecule in those cells relative to 
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corresponding cells having a recombinant nucleic acid molecule lacking a DNA 
molecule of the invention. 

A variant DNA molecule, sequence or segment of the invention has at 
least about 70%, preferably at least about 80%, and more preferably at least 

5 about 90%, but less than 1 00%, contiguous nucleotide sequence homology or 
identity to a DNA molecule comprising ITR sequences from a circular 
intermediate of AAV, e.g., SEQ ID N0:3, SEQ ID N0:4, SEQ ED N0:5, SEQ 
ID N0:6, a subunit thereof A variant DNA molecule of the invention may 
include nucleotide bases not present in SEQ ID N0:3, SEQ ID N0:4, SEQ ED 

10 N0:5, SEQ ID N0:6, e.g., 5', 3' or internal deletions or insertions, such as the 
insertion of a restriction endonuclease recognition site, so long as these bases do 
not substantially reduce the biological activity of the molecule. A substantial 
reduction in activity means a reduction in activity of greater than about 50%, 
preferably greater than about 90%. 

15 T Identiflcatinn nFNticleic AriH Mnlemites Falling Within the Scone of the 
Invention 

A Niirleir Add M nlemiles of the Invention 

1 Sniirres of the N iirleic Acid Mnlernles of the Invention 

Sources of nucleotide sequences from which the present nucleic acid 

20 molecules can be obtained include AAV infected cells, e.g., any vertebrate, 
preferably mammalian, cellular source. 

As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a nucleic acid, e.g., DNA molecule from its natural cellular 
environment, and from association with other components of the cell, such as 

25 nucleic acid or polypeptide, so that it can be sequenced, replicated, and/or 
expressed. For example, "isolated nucleic acid'* is RNA or DNA containing 
greater than about 50, preferably about 300, and more preferably about 500 or 
more, sequential nucleotide bases that comprise a DNA segment from a circular 
intermediate of AAV which contains at least a portion of the 5' and 3 ' ITRs and 

30 the D sequence, or a variant thereof, that is complementary or hybridizes, 
respectively, to AAV ITR DNA and remains stably bound under stringent 
conditions, as defined by methods well known in the art, e.g., in Sambrook et al., 
1989. Thus, the RNA or DNA is "isolated" in that it is free bom at least one 
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contaminating nucleic acid with which it is normally associated in the natural 
source of the RNA or DNA and is preferably substantially free of any other 
mammalian RNA or DNA. The phrase "free from at least one contaminating 
source nucleic acid with which it is nomially associated" includes the case where 
5 the nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 
nonmally found in the source cell, e.g., in a vector or plasmid. An example of 
isolated nucleic acid within the scope of the invention is nucleic acid that shares 
at least about 80%, preferably at least about 90%, and more preferably at least 

10 about 95%, sequence identity with SEQ ID N0:3, SEQ ID NO:4, SEQ ID N0:5 
or SEQ ID NO:6, or a subunit thereof 

As used herein, the temi "recombinant nucleic acid" or "preselected 
nucleic acid," e.g., "recombinant DNA sequence or segment" or "preselected 
DNA sequence or segment" refers to a nucleic acid, e.g., to DNA, that has been 

1 5 derived or isolated from any appropriate cellular source, that may be 

subsequently chemically aUered in vitro^ so that its sequence is not naturally 
occurring, or corresponds to naturally occurring sequences that are not 
positioned as they would be positioned in a genome which has not been 
transformed with exogenous DNA. An example of preselected DNA "derived" 

20 from a source, would be a DNA sequence that is identified as a useful fragment 
within a given organism, and which is then chemically synthesized in essentially 
pure form. An example of such DNA "isolated" from a source would be a usefiil 
DNA sequence that is excised or removed from said source by chemical means, 
e.g., by the use of restriction endonucleases, so that it can be fiirther 

25 manipulated, e.g., airiplified, for use in the invention, by the methodology of 
genetic engineering. 

Thus, recovery or isolation of a given fragment of DNA from a restriction 
digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 

30 mobility versus that of marker DNA firagments of known molecular weight, 

removal of the gel section containing the desired fragment, and separation of the 
gel from DNA. See Lawn et al., Nuclfiir Acids Res., 2, 6103 (1981), and 
Goeddel et al., Nnrlpir Acid*; Res, . S, 4057 (1980). Therefore, ^'preselected 
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DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 

Nucleic acid molecules having base pair substitutions (i.e., variants) are 

5 prepared by a variety of methods known in the art. These methods include, but 
are not limited to, isolation from a natural source (in the case of naturally 
occurring sequence variants) or preparation by oligonucleotide-mediated (or site- 
directed) mutagenesis, PGR mutagenesis, and cassette mutagenesis of an earlier 
prepared variant or a non-variant version of the nucleic acid molecule. 

1 0 Ohgonucleotide-mediated mutagenesis is a preferred method for 

preparing substitution variants. This technique is well known in the art as 
described by Adelman et al., DNA, 2, 183 (1983). Briefly, AAV DNA is altered 
by hybridizing an oligonucleotide encoding the desired mutation to a DNA 
template, where the template is the single-stranded form of a plasmid or 

1 5 bacteriophage containing the unaltered or native DNA sequence of AAV. After 
hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in the AAV 
DNA. 

20 Generally, oligonucleotides of at least 25 nucleotides in length are used. 

An optimal oHgonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 
single-stranded DNA template molecule. The oligonucleotides are readily 

25 synthesized using techniques known in the art such as that described by Crea et 
al., Prnr Natl. Acad Sci. U.S.A. , 25, 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage M13 vectors (the commercially available M13mpl8 
and M13mpl9 vectors are suitable), or those vectors that contain a single- 

30 stranded phage origin of replication as described by Viera et al., Meth . Enzymol ,, 
iH, 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al,, 
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Molecular nntiing- A L ahnratnrv Manual rCold Spring Harbor Laboratory 

Press, N.Y. 1989). 

Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 
5 For alteration of the native DNA sequence (to generate amino acid 

sequence variants, for example), the oligonucleotide is hybridized to the single- 
stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 

10 a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of AAV, and the other strand (the 
original template) encodes the native, unaltered sequence of AAV. This 
heteroduplex molecule is then transformed into a suitable host cell, usually a 
prokaiyote such as E. coli JMIOI. After the cells are grown, they are plated onto 

1 5 agarose plates and screened using the oligonucleotide primer radiolabeled with 
32-phosphate to identify the bacterial colonies that contain the mutated DNA. 
The mutated region is then removed and placed in an appropriate vector, 
generally an expression vector of the type typically employed for transformation 
of an appropriate host. 

20 The method described immediately above may be modified such that a 

homoduplex molecule is created wherein both strands of the plasmid contain the 
mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 

25 deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained 
from the Amersham Corporation). This mixture is added to the template- 
oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 

30 generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked 
with an appropriate restriction enzyme, the template strand can be digested with 
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Exoin nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
5 deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E, coli JMIOI. 

For example, a preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a DNA segment comprising SEQ ID N0:3, 
SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, a subunit thereof or a variant 

1 0 thereof having nucleotide substitutions, or deletions or insertions. 

TT. Preparation of Mnlecules TTsefal to Practice th e Methods nf the Invention 

A. Nucleic Acid Molecules 

r. Chimerir Fypression Cassettes 

To prepare expression cassettes for transformation herein, the 

1 5 recombinant or preselected DNA sequence or segment may be circular or linear, 
double-stranded or single-stranded. Generally, the preselected DNA sequence or 
segment is in the form of chimeric DNA, such as plasmid DNA, that can also 
contain coding regions flanked by control sequences which promote the 
expression of the preselected DNA present in the resultant cell line. 

20 As used herein, "chimeric" means that a vector comprises DNA from at 

least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 
type of the species. 

Aside from the preselected DNA sequences described above, a portion of 

25 the preselected DNA may serve a regulatory or a structural function. For 

example, the preselected DNA may itself comprise a promoter that is active in 
manmialian cells, or may utilize a promoter already present in the genome that is 
the transformation target. Such promoters include the CMV promoter, as well as 
the SV40 late promoter and retroviral LTRs (long temiinal repeat elements), 

30 although many other promoter elements well known to the art may be employed 
in the practice of the invention. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected 
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DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 
stabihty of the mRNA, or the like. Such elements may be included in the DNA 
as desired to obtain the optimal performance of the transforming DNA in the 
5 cell. 

"Control sequences" is defined to mean DNA sequences necessary for 
the expression of an operably linked coding sequence in a particular host 
organism. The control sequences that are suitable for prokaryotic cells, for 
example, include a promoter, and optionally an operator sequence, and a 

1 0 ribosome binding site. Eukaiyotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 
a fimctional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a peptide or 

15 polypeptide if it is expressed as a preprotein that participates in the secretion of 
the peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being 

20 linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

25 The preselected DNA to be introduced into the cells fiirther will generally 

contain either a selectable marker gene or a reporter gene or both to faciUtate 
identification and selection of transformed cells from the population of cells 
sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 

30 selectable markers and reporter genes may be flanked with appropriate 

regulatory sequences to enable expression in the host cells. Usefiil selectable 
markers are well known in the art and include, for example, antibiotic and 
herbicide-resistance genes, such as neo, hpt, dhfr, bar, aroA, dapA and the like. 
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See also, the genes listed on Table 1 of Limdquist at al. (U.S. Patent No. 
5,848,956). 

Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
5 encode for easily assayable proteins are well known in the art. In general, a 
reporter gene is a gene which is not present in or expressed by the recipient 
organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of £. coli, the 

10 beta-glucuronidase gene (gus) of the uidA locus of £. coli, and the luciferase 
gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

15 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Clnning: A 
T ahnratnry Mflnnal , Cold Spring Harbor Laboratory Press (2d ed,, 1989), 
provides suitable methods of construction. 
2. Transformation into Host Cells 

20 The recombinant DNA can be readily introduced into the host cells, e.g., 

mammalian, bacterial, yeast or insect cells by transfection with an expression 
vector of the invention, by any procedure useful for the introduction into a 
particular cell, e.g., physical or biological methods, to yield a transformed cell 
having the recombinant DNA stably integrated into its genome or present as an 

25 episome which can persist in the transformed cells, so that the DNA molecules, 
sequences, or segments, of the present invention are maintained and/or expressed 
by the host cell. 

Physical methods to introduce a preselected DNA into a host cell include 
calcium phosphate precipitation, Hpofection, particle bombardment, 
30 microinjection, electroporation, and the like. Biological methods to introduce 
the DNA of interest into a host cell include the use of DNA and RNA viral 
vectors. The main advantage of physical methods is that they are not associated 
with pathological or oncogenic processes of viruses. However, they are less 
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precise, often resulting in multiple copy insertions, random integration, 
disruption of foreign and endogenous gene sequences, and unpredictable 
expression. 

As used herein, the term "cell line" or "host cell" is intended to refer to 
5 well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 
prokaiyotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
1 0 plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is related to a DNA sequence which is resident in the genome of the 
host cell but is not expressed, or not highly expressed, or, alternatively, 
overexpressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
1 5 cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 
of genetic engineering as "heterologous DNA," "recombinant DNA," 
"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host cell 
20 or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 
plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. 

To confirm the presence of the preselected DNA sequence in the host 
25 cell, a variety of assays may be performed. Such assays include, for example, 
"molecular biological" assays well known to those of skill in the art, such as 
Southern and Northern blotting, RT-PCR and PCR; "biochemical" assays, such 
as detecting the presence of a polypeptide expressed from a gene present in the 
vector, e.g., by iimnunological means (iimnunoprecipitations, immimoafifinity 
30 columns, ELISAs and Western blots) or by any other assay useful to identify 
molecules falling within the scope of the invention. 

To detect and quantitate RNA produced from introduced DNA segments, 
RT-PCR may be employed. In this application of PCR, it is first necessary to 
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reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, 
and then through the use of conventional PCR techniques amplify the DNA. In 
most instances PCR techniques, while useful, will not demonstrate integrity of 
the RNA product. Further information about the nature of the RNA product may 
5 be obtained by Northern blotting. This technique demonstrates the presence of 
an RNA species and gives infomiation about the integrity of that RNA. The 
presence or absence of an RNA species can also be determined using dot or slot 
blot Northern hybridizations. These techniques are modifications of Northern 
blotting and only demonstrate the presence or absence of an RNA species. 

1 0 While Southern blotting and PCR may be used to detect the DNA 

segment in question, they do not provide information as to whether the DNA 
segment is being expressed. Expression may be evaluated by specifically 
identifying the polypeptide products of the introduced DNA sequences or 
evaluating the phenotypic changes brought about by the expression of the 

1 5 introduced DNA segment in the host cell. 

TTT. Dosages , Fnrmiilat innf; and Routes of Administration 

Administration of a nucleic acid molecule may be accomplished through 
the introduction of cells transformed with the nucleic acid molecule (see, for 
example, WO 93/02556), the administration of the nucleic acid molecule itself 

20 (see, for example, Feigner et al., U.S. Patent No. 5,580,859, Pardoll et al., 
Tmmnnity ^ X 165 (1995); Stevenson et al., Tmmunol. Rev,, 145, 21 1 (1995); 
Moiling, T Mol Med. . 25, 242 (1997); Donnelly et al, Ann. N.Y. Acad. Sci., 
ZZ2, 40 (1995); Yang et al., Mnl Med Today. 2, 476 (1996); Abdallah et al., 
Riol rell , Si, 1 (1995)), through infection with a recombinant virus or via 

25 liposomes. Pharmaceutical formulations, dosages and routes of administration 
for nucleic acids are generally disclosed, for example, in Feigner et al., supra. 
Administration of the therapeutic agents in accordance with the present 
invention may be continuous or intermittent, depending, for example, upon the 
recipient's physiological condition, whether the purpose of the administration is 

30 therapeutic or prophylactic, and other factors known to skilled practitioners. The 
administration of the agents of the invention may be essentially continuous over 
a preselected period of time or may be in a series of spaced doses. Both local 
and systemic administration is contemplated. When the molecules of the 
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invention are employed for prophylactic purposes, agents of the invention are 
amenable to chronic use, preferably by systemic administration. 

One or more suitable unit dosage forms comprising the therapeutic agents 
of the invention, which, as discussed below, may optionally be formulated for 
5 sustained release, can be administered by a variety of routes including oral, or 
parenteral, including by rectal, transdermal, subcutaneous, intravenous, 
intramuscular, intraperitoneal, intrathoracic, intrapulmonary and intranasal 
routes. The formulations may, where appropriate, be conveniently presented in 
discrete unit dosage forms and may be prepared by any of the methods well 

1 0 known to pharmacy. Such methods may include the step of bringing into 

association the therapeutic agent with liquid carriers, solid matrices, semi-solid 
carriers, finely divided solid carriers or combinations thereof, and then, if 
necessary, introducing or shaping the product into the desired delivery system. 
When the therapeutic agents of the invention are prepared for oral 

1 5 administration, they are preferably combined with a pharmaceutically acceptable 
carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
form. The total active ingredients in such formulations comprise from 0.1 to 
99.9% by weight of the formulation. By "pharmaceutically acceptable" it is 
meant the carrier, diluent, excipient, and/or salt must be compatible with the 

20 other ingredients of the formulation, and not deleterious to the recipient thereof 
The active ingredient for oral administration may be present as a powder or as 
granules; as a solution, a suspension or an emulsion; or in achievable base such 
as a synthetic resin for ingestion of the active ingredients from a chewing gum. 
The active ingredient may also be presented as a bolus, electuary or paste. 

25 Pharmaceutical formulations containing the therapeutic agents of the 

invention can be prepared by procedures known in the art using well known and 
readily available ingredients. For example, the agent can be formulated with 
common excipients, diluents, or carriers, and formed into tablets, capsules, 
suspensions, powders, and the like. Examples of excipients, diluents, and 

30 carriers that are suitable for such formulations include the following fillers and 
extenders such as starch, sugars, maimitol, and sihcic derivatives; binding agents 
such as carboxymethyl cellulose, HPMC and other cellulose derivatives, 
alginates, gelatin, and polyvinyl-pyrrolidone; moisturizing agents such as 
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glycerol; disintegrating agents such as calcium carbonate and sodium 
bicarbonate; agents for retarding dissolution such as parafifin; resorption 
accelerators such as quaternary ammonium compoimds; surface active agents 
such as cetyl alcohol, glycerol monostearate; adsorptive carriers such as kaolin 

5 and bentonite; and lubricants such as talc, calcium and magnesium stearate, and 
solid polyethyl glycols. 

For example, tablets or caplets containing the agents of the invention can 
include buffering agents such as calcium carbonate, magnesium oxide and 
magnesium carbonate. Caplets and tablets can also include inactive ingredients 

10 such as cellulose, pregeiatinized starch, silicon dioxide, hydroxy propyl methyl 
cellulose, magnesium stearate, microcrystalline cellulose, starch, talc, titanium 
dioxide, benzoic acid, citric acid, com starch, mineral oil, polypropylene glycol, 
sodium phosphate, and zinc stearate, and the like. Hard or soft gelatin capsules 
containing an agent of the invention can contain inactive ingredients such as 

1 5 gelatin, microcrystalline cellulose, sodium lauryl sulfate, starch, talc, and 
titanium dioxide, and the like, as well as Uquid vehicles such as polyethylene 
glycols (PEGs) and vegetable oil Moreover, enteric coated caplets or tablets of 
an agent of the invention are designed to resist disintegration in the stomach and 
dissolve in the more neutral to alkaline environment of the duodenum. 

20 The therapeutic agents of the invention can also be formulated as elixirs 

or solutions for convenient oral administration or as solutions appropriate for 
parenteral administration, for instance by intramuscular, subcutaneous or 
intravenous routes. 

The pharmaceutical formulations of the therapeutic agents of the 

25 invention can also take the form of an aqueous or anhydrous solution or 
dispersion, or altematively the form of an emulsion or suspension. 

Thus, the therapeutic agent may be formulated for parenteral 
administration (e.g., by injection, for example, bolus injection or continuous 
infusion) and may be presented in unit dose form in ampules, pre-filled syringes, 

30 small v olume infusion containers or in multi-dose containers with an added 
preser\'aiive. The active ingredients may take such forms as suspensions, 
solutions, or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Altematively, 
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the active ingredients may be in powder form, obtained by aseptic isolation of 
sterile solid or by lyophilization from solution, for constitution with a suitable 
vehicle, e.g., sterile, pyrogen-free water, before use. 

These formulations can contain phannaceutically acceptable vehicles and 
5 adjuvants which are well known in the prior art. It is possible, for example, to 
prepare solutions using one or more organic solvent(s) that is/are acceptable 
from the physiological standpoint, chosen, in addition to water, from solvents 
such as acetone, ethanol, isopropyl alcohol, glycol ethers such as the products 
sold under the name "Dowanol", polyglycols and polyethylene glycols, C1-C4 
1 0 alkyl esters of short-chain acids, preferably ethyl or isopropyl lactate, fatty acid 
triglycerides such as the products mariceted under the name "Miglyol", isopropyl 
myristate, animal, mineral and vegetable oils and polysiloxanes. 

The compositions according to the invention can also contain thickening 
agents such as cellulose and/or cellulose derivatives. They can also contain 
1 5 gums such as xanthan, guar or carbo gum or gum arabic, or alternatively 
polyethylene glycols, bentones and montmorillonites, and the like. 

It is possible to add, if necessary, an adjuvant chosen from antioxidants, 
surfactants, other preservatives, fihn-fonning, keratolytic or comedolytic agents, 
perfumes and colorings. Also, other active ingredients may be added, whether 
20 for the conditions described or some other condition. 

For example, among antioxidants, t-butylhydroquinone, butylated 
hydroxyanisole, butylated hydroxytoluene and a-tocopherol and its derivatives 
may be mentioned. The galenical forms chiefly conditioned for topical 
application take the form of creams, milks, gels, dispersion or microemulsions, 
25 lotions thickened to a greater or lesser extent, impregnated pads, ointments or 
sticks, or alternatively the form of aerosol formulations in spray or foam form or 
alternatively in the form of a cake of soap. 

Additionally, the agents are well suited to formulation as sustained 
release dosage forms and the like. The formulations can be so constituted that 
30 they release the active ingredient only or preferably in a particular part of the 
intestinal or respiratory tract, possibly over a period of time. The coatings, 
envelopes, and protective matrices may be made, for example, from polymeric 
substances, such as polylactide-glycolates, liposomes, microemulsions, 
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microparticles, nanoparticles, or waxes. These coatings, envelopes, and 
protective matrices are useful to coat indwelling devices, e.g., stents, catheters, 
peritoneal dialysis tubing, and the like. 

The therapeutic agents of the invention can be delivered via patches for 
5 transdermal administration. See U.S. Patent No. 5,560,922 for examples of 
patches suitable for transdermal delivery of a therapeutic agent. Patches for 
transdermal delivery can comprise a backing layer and a polymer matrix which 
has dispersed or dissolved therein a therapeutic agent, along with one or more 
skin permeation enhancers. The backing layer can be made of any suitable mate- 
10 rial which is impermeable to the therapeutic agent. The backing layer serves as a 
protective cover for the matrix layer and provides also a support function. The 
backing can be fonned so that it is essentially the same size layer as the polymer 
matrix or it can be of larger dimension so that it can extend beyond the side of 
the polymer matrix or overiay the side or sides of the polymer matrix and then 
1 5 can extend outwardly in a manner that the surface of the extension of the backing 
layer can be the base for an adhesive means. Alternatively, the polymer matrix 
can contain, or be foraiulated of, an adhesive polymer, such as polyacrylate or 
acrylate/vinyl acetate copolymer. For long-term applications it might be desir- 
able to use microporous and/or breathable backing laminates, so hydration or 
20 maceration of the skin can be minimized. 

Examples of materials suitable for making the backing layer are fihns of 
high and low density polyethylene, polypropylene, polyurethane, 
polyvinylchloride, polyesters such as poly(ethylene phthalate), metal foils, metal 
foil laminates of such suitable polymer fihns, and the like. Preferably, the 
25 materials used for the backing layer are laminates of such polymer fihns with a 
metal foil such as aluminum foil. In such laminates, a polymer fihn of the 
laminate will usually be in contact with the adhesive polymer matrix. 

The backing layer can be any appropriate thickness which will provide 
the desired protective and support functions. A suitable thickness will be from 
30 about 1 0 to about 200 microns. 

Generally, those polymers used to form the biologically acceptable 
adhesive polymer layer are those capable of forming shaped bodies, thin walls or 
coatings through which therapeutic agents can pass at a controlled rate. Suitable 
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polymers are biologically and pharmaceutically compatible, nonallergenic and 
insoluble in and compatible with body fluids or tissues with which the device is 
contacted. The use of soluble polymers is to be avoided since dissolution or 
erosion of the matrix by skin moisture would affect the release rate of the 
5 therapeutic agents as well as the capability of the dosage unit to remain in place 
for convenience of removal. 

Exemplary materials for fabricating the adhesive polymer layer include 
polyethylene, polypropylene, polyurethane, ethylene/propylene copolymers, 
ethylene/ethylacrylate copolymers, ethylene/vinyl acetate copolymers, silicone 

1 0 elastomers, especially the medical-grade polydimethylsiloxanes, neoprene 
rubber, polyisobutylene, polyacrylates, chlorinated polyethylene, polyvinyl 
chloride, vinyl chloride-vinyl acetate copolymer, crosslinked polymethacrylate 
polymers (hydrogel), polyvinylidene chloride, poly(ethylene terephthalate), butyl 
rubber, epichlorohydrin rubbers, ethylenvinyl alcohol copolymers, ethylene- 

1 5 vinyloxyethanol copolymers; silicone copolymers, for example, polysiloxane- 
polycarbonate copolymers, polysiloxanepolyethylene oxide copolymers, 
polysiloxane-polymethacrylate copolymers, polysiloxane-alkylene copolymers 
(e.g., polysiloxane-ethylene copolymers), polysiloxane-alkylenesilane 
copolymers (e.g., polysiloxane-ethylenesilane copolymers), and the like; 

20 cellulose polymers, for example methyl or ethyl cellulose, hydroxy propyl 

methyl cellulose, and cellulose esters; polycarbonates; polytetrafluoroethylene; 
and the like. 

Preferably, a biologically acceptable adhesive polymer matrix should be 
selecied from polymers with glass transition temperatures below room 

25 temperature. The polymer may, but need not necessarily, have a degree of 

crystallinity at room temperature. Cross-linking monomeric units or sites can be 
incorporated into such polymers. For example, cross-linking monomers can be 
incorporated into polyacrylate polymers, which provide sites for cross-linking 
the matrix after dispersing the therapeutic agent into the polymer. Known cross- 

30 linking monomers for polyacrylate polymers include polymethacrylic esters of 
polyols such as butylene diacrylate and dimethacrylate, trimethylol propane 
trimethacrylate and the like. Other monomers which provide such sites include 
allyl acrylate, allyl methacrylate, diallyl maleate and the like. 
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Preferably, a plasticizer and/or humectant is dispersed within the 
adhesive polymer matrix. Water-soluble polyols are generally suitable for this 
purpose. Incorporation of a humectant in the formulation allows the dosage unit 
to absorb moisture on the surface of skin which in turn helps to reduce skin 
5 irritation and to prevent the adhesive polymer layer of the delivery system from 
failing. 

Therapeutic agents released from a transdermal delivery system must be 
capable of penetrating each layer of skin. In order to increase the rate of 
penneation of a therapeutic agent, a transdermal drug delivery system must be 

1 0 able in particular to increase the permeability of the outermost layer of skin, the 
stratum comeum, which provides the most resistance to the penetration of 
molecules. The fabrication of patches for transdermal deUvery of therapeutic 
agents is well known to the art. 

For administration to the upper (nasal) or lower respiratory tract by 

1 5 inhalation, the therapeutic agents of ttie invention are conveniently delivered 
from an insufflator, nebulizer or a pressurized pack or other convenient means of 
delivering an aerosol spray. Pressurized packs may comprise a suitable 
propellant such as dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 

20 pressurized aerosol, the dosage unit may be determined by providing a valve to 
deliver a metered amount. 

Altematively, for administration by inhalation or insufflation, the 
composition may take the form of a dry powder, for example, a powder mix of 
the therapeutic agent and a suitable powder base such as lactose or starch. The 

25 powder composition may be presented in unit dosage form in, for example, 
capsules or cartridges, or, e.g., gelatine or bUster packs from which the powder 
may be administered with the aid of an inhalator, insufflator or a metered-dose 
inhaler. 

For intra-nasal administration, the therapeutic agent may be administered 
30 via nose drops, a liquid spray, such as via a plastic bottle atomizer or metered- 
dose inhaler. Typical of atomizers are the Mistometer (Wintrop) and the 
Medihaler (Riker). 
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The local delivery of the therapeutic agents of the invention can also be 
by a variety of techniques which administer the agent at or near the site of 
disease. Examples of site-specific or targeted local delivery techniques are not 
intended to be limiting but to be illustrative of the techniques available. 

5 Examples include local delivery catheters, such as an infusion or indwelling 
catheter, e.g., a needle infusion catheter, shunts and stents or other implantable 
devices, site specific carriers, direct injection, or direct applications. 

For topical administration, the therapeutic agents may be formulated as is 
known in the art for direct application to a target area. Conventional forms for 

1 0 this purpose include wound dressings, coated bandages or other polymer 
coverings, ointments, creams, lotions, pastes, jellies, sprays, and aerosols. 
Ointments and creams may, for example, be formulated with an aqueous or oily 
base with the addition of suitable thickening and/or gelling agents. Lotions may 
be formulated with an aqueous or oily base and will in general also contain one 

1 5 or more emulsifying agents, stabilizing agents, dispersing agents, suspending 
agents, thickening agents, or coloring agents. The active ingredients can also be 
delivered via iontophoresis, e.g., as disclosed in U.S. Patent Nos. 4,140,122; 
4,383,529; or 4,051,842. The percent by weight of a therapeutic agent of the 
invention present in a topical formulation will depend on various factors, but 

20 generally will be fi-om 0.01 % to 95% of the total weight -of the formulation, and 
typically 0.1-25% by weight. 

Drops, such as eye drops or nose drops, may be formulated with an 
aqueous or non-aqueous base also comprising one or more dispersing agents, 
solubilizing agents or suspending agents. Liquid sprays are conveniently 

25 delivered fi-om pressurized packs. Drops can be delivered via a simple eye 
dropper-capped bottle, or via a plastic bottle adapted to deliver liquid contents 
dropwise, via a specially shaped closure. 

The therapeutic agent may further be formulated for topical 
administration in the mouth or throat. For example, the active ingredients may 

30 be formulated as a lozenge further comprising a flavored base, usually sucrose 
and acacia or tragacanth; pastilles comprising the composition in an inert base 
such as gelatin and glycerin or sucrose and acacia; and mouthwashes comprising 
the composition of the present invention in a suitable liquid carrier. 
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The formulations and compositions described herein may also contain 
other ingredients such as antimicrobial agents, or preservatives. Furthermore, 
the active ingredients may also be used in combination with other therapeutic 
agents, for example, bronchodilators. 

5 In particular, for delivery of a vector of the invention to a tissue such as 

muscle, any physical or biological method that will introduce the vector into the 
muscle tissue of a host animal can be employed. Vector means both a bare 
recombinant vector and vector DNA packaged into viral coat proteins, as is well 
known for AAV administration. Simply dissolving an AAV vector in phosphate 

10 buffered saline has been demonstrated to be sufficient to provide a vehicle useful 
for muscle tissue expression, and there are no known restrictions on the carriers 
or other components that can be coadministered with the vector (although 
compositions that degrade DNA should be avoided in the normal manner with 
vectors). Pharmaceutical compositions can be prepared as injectable 

1 5 formulations or as topical formulations to be deUvered to the muscles by 

transdermal transport. Numerous formulations for both intramuscular injection 
and transdermal transport have been previously developed and can be used in the 
practice of the invention. The vectors can be used with any pharmaceutically 
acceptable carrier for ease of administration and handling. 

20 For purposes of intramuscular injection, solutions in an adjuvant such as 

sesame or peanut oil or in aqueous propylene glycol can be employed, as well as 
sterile aqueous solutions. Such aqueous solutions can be buffered, if desired, 
and the liquid diluent first rendered isotonic with saline or glucose. Solutions of 
the AAV vector as a free acid (DNA contains acidic phosphate groups) or a 

25 pharmacologically acceptable salt can be prepared in water suitably mixed with a 
surfactant such as hydroxypropylcellulose. A dispersion of AAV viral particles 
can also be prepared in glycerol, liquid polyethylene glycols and mixtures 
thereof and in oils. Under ordinary conditions of storage and use, these 
preparations contain a preservative to prevent the growth of microorganisms. In 

30 this connection, the sterile aqueous media employed are all readily obtainable by 
standard techniques well-known to those skilled in the art. 

The pharmaceutical forms suitable for injectable use include sterile 
aqueous solutions or dispersions and sterile powders for the extemporaneous 
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prq)aration of sterile injectable solutions or dispersions. In all cases the form 
must be sterile and must be fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria 

5 and fimgi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, liquid 
polyethylene glycol and the like), suitable mixtures thereof, and vegetable oils. 
The proper fluidity can be maintained, for example, by the use of a coating such 
as lecithin, by the maintenance of the required particle size in the case of a 

1 0 dispersion and by the use of surfactants. The prevention of the action of 
microorganisms can be brought about by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal and 
the like. In many cases it will be preferable to include isotonic agents, for 
example, sugars or sodium chloride. Prolonged absorption of the injectable 

1 5 compositions can be brought about by use of agents delaying absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the AAV vector 
in the required amount in the appropriate solvent with various of the other 
ingredients enumerated above, as required, followed by filtered sterilization. 

20 Generally, dispersions are prepared by incorporating the sterilized active 

ingredient into a sterile vehicle which contains the basic dispersion medium and 
the required other ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable solutions, the preferred 
methods of preparation are vacuum drying and the freeze drying technique which 

25 yield a powder of the active ingredient plus any additional desired ingredient 
from the previously sterile-filtered solution thereof. 

For purposes of topical administration, dilute sterile, aqueous solutions 
(usually in about 0.1% to 5% concentration), otherwise similar to the above 
parenteral solutions, are prepared in containers suitable for incorporation into a 

30 transdermal patch, and can include known carriers, such as pharmaceutical grade 
dimethylsulfoxide (DMSO). 

The therapeutic compounds of this invention may be administered to a 
mammal alone or in combination with pharmaceutically acceptable carriers. As 
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noted above, the relative proportions of active ingredient and carrier are 
determined by the solubility and chemical nature of the compound, chosen route 
of administration and standard pharmaceutical practice. 

The dosage of the present therapeutic agents which will be most suitable 
5 for prophylaxis or treatment will vary with the form of administration, the 
particular compound chosen and the physiological characteristics of the 
particular patient under treatment. Generally, small dosages will be used 
initially and, if necessary, will be increased by small increments until the 
optimum effect under the circumstances is reached. Exemplary dosages are set 
1 0 out in the example below. 

Since AAV has been shown to have a broad host range (for pulmonary 
expression) and persists in muscle, the vectors of the invention may be employed 
to express a gene in any animal, and particularly in mammals, birds, fish, and 
reptiles, especially domesticated mammals and birds such as cattle, sheep, pigs, 
15 horses, dogs, cats, chickens, and turkeys. Both human and veterinary uses are 
particularly preferred. 

The gene being expressed can be either a DNA segment encoding a 
protein, with whatever control elements (e.g., promoters, operators) are desired 
by the user, or a non-coding DNA segment, the transcription of which produces 
20 all or part of some RNA-containing molecule (such as a transcription control 
element, +RNA, or anti-sense molecule). 

Muscle tissue is a very attractive target for in vivo gene delivery and gene 
therapy, because it is not a vital organ and is very easy to access. If a disease is 
caused by a defective gene product which is required to be produced and/or 
25 secreted, such as hemophilia, diabetes and Gaucher's disease, and the like, is 

muscle is a good candidate to supply the gene product if the appropriate gene can 
be effectively delivered into the cells. 

Different vectors, such as naked DNA, adenovirus and retrovirus, have 
been utilized to directly deliver various transgenes into muscle tissues. 
30 However, neither system can offer both high efficiency and long-term 

expression. For naked plasmid DNA directly delivered into muscle tissue, the 
efficiency is not high. There are only a few cells near the injection site that can 
maintain transgene expression. Furthermore, the plasmid DNA in the cells 
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remaiiis as non-replicating episomes, i.e., in the unintegrated form. Therefore, it 
will be eventually lost. For adenovirus vector, it can infect the non-dividing 
cells, and therefore, can be directly delivered into the mature tissues such as 
muscle. However, the transgene delivered by adenovirus vectors are not useful 

5 to maintain long-term expression for the following reasons. First, since 
adenovirus vectors still retain most of the viral genes, they are not very safe. 
Moreover, the expression of those genes can cause the immune system to destroy 
the cells containing the vectors (see, for example, Yang et al. 1994, Proc. Natl. 
Acad. Sci. 91 :4407-441 1). Second, since adenovirus is not an integration virus, 

10 its DNA will eventually be diluted or degraded in the cells. Third, due to the 
immune response, adenovims vector could not be repeatedly delivered. In the 
case of lifetime diseases, this will be a major Umitation. For retrovirus vectors, 
although they can achieve stable integration into the host chromosomes, their use 
is very restricted because they can only infect dividing cells while a large 

1 5 majority of the muscle cells are non-dividing. 

Adeno-associated virus vectors have certain advantages over the above- 
mentioned vector systems. First, like adenovirus, AAV can efficiently infect 
non-dividing cells. Second, all the AAV viral genes are eliminated in the vector. 
Since the viral-gene-expression-induced immune reaction is no longer a concern, 

20 AAV vectors are safer than Ad vectors. Thirds, AAV is an integration virus by 
nature, and integration into the host chromosome will stably maintain its 
transgene in the cells. Fourth, AAV is an extremely stable virus, which is 
resistant to many detergents, pH changes and heat (stable at 56°C for more than 
an hour). It can be lyophilized and redissolved without losing its activity. 

25 Therefore, it is a very promising delivery vehicle for gene therapy. 

The invention will be further described by, but is not limited to, the 
following examples. 

Kxampic 1 

Matftrialf; and Methods 
30 Constnictinn n f rA AV Shuttle Vfictor. 

A recombinant AAV shuttle vector (AV.GFP3ori) which contained a 
GFP transgene cassette, bacterial ampicillin resistance gene, and bacterial origin 
of replication, was generated from a cw-acting plasmid (pCisAV.GFPSori). 
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Expression of the GFP gene was directed by the CMV promoter/enhancer and 
SV40 poly-adenylation sequences. pCisAV.GFP3ori was constructed with 
pSub201 derived ITR elements (Samulski et ah, 1987) and the intactness of ITR 
sequences was confirmed by restriction analysis with Smal and PvuII, and by 

5 sequencing. Recombinant AAV stocks were generated by co-transfection of 
pCisAV.GFP3ori and pRep/Cap together with co-infection of recombinant 
Ad.CMVlacZ in 293 cells (Duan et al, 1997). Following transfection of forty 
150 mm plates, cells were collected at 72 hours by centrifugation and 
resuspended in 12 ml of buffer (10 mM Tris pH 8.0). Vims was released from 

1 0 cells by three cycles of freeze/thawing and passaged through a 25 gauge needle 
six times. Cell lysates were then treated with 13 mg/ml DNase I at 37°C for 30 
minutes and 1% deoxycholate (g/ml final) and 0.05% trypsin (g/ml fmal) at 
37°C for 30 minutes. Samples were then placed on ice for 10 minutes and 
centrifiiged to remove large particulate material at 3,000 rpm for 30 minutes. 

15 rAAV was purified by isopycnic density gradient centrifiigation in CsCl 

(r=l .4) in a SW55 rotor for 72 hours at 35K. Peak fractions of AAV were 
combined and re-purified through two more rounds of CsCI centrifiigation, 
followed by heating at 58 ""C for 60 minutes to inactivate all contaminant helper 
adenovirus. Typically, this preparation gave approximate AAV titers of 10^^ 

20 DNA molecules/ml and 2.5 x 10^ GFP-expressing units/ml. Recombinant viral 
titers were assessed by slot blot and quantified against pCisAV.GFP3ori controls 
for DNA particles. Functional transducing units were quantified by GFP 
transgene expression in 293 cells. The absence of helper adenovirus was 
confirmed by histochemical staining of rAAV infected 293 cells for 

25 beta-galactosidase, and no recombinant adenovirus was found in 10*** particles of 
purified rAAV stocks. The absence of significant wtAAV contamination was 
confirmed by immunocytochemical staining of rAAV/ Ad co-infected 293 cells 
with anti-Rep antibodies. These studies, which had a sensitivity of 1 wtAAV in 
10*** rAAV particles, demonstrated an absence of Rep staining as compared to 

30 pRep/Cap plasmid transfected controls. 
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T«;nlatinn and Structural Fvaluatinn nf AAV rirnilar Tntermediates From Hela 

Hela cells were grown in 35 mm dishes in DMEM media supplemented 
with 10% fetal calf serum (PCS). Cells were infected in the presence of 2% PCS 
5 at 80% confluency with recombinant AV.GFP3ori (MOI=1000 particles/cell, 1 x 
1 0^ total particles/plate) and Hirt DNAs isolated as described by Duan et al. 
(1997) at 6, 12, 24, 48, and 72 hours post-infection. In experiments analyzing 
the effects of adenovirus, plates were co-infected with Ad.CMVLacZ 
(MOI=5000 particles/cell) in the presence of 2%FCS/DMEM. Zero hour 

1 0 controls were generated by mixing 1 0^ particles of AV.GFP3ori with cell lysates 
prior to Hirt DNA preparation. Hirt DNA isolated at each time point was used to 
transform £. coli SURE cells (Stratagene, La Jolla, CA.). Typically, 1/10 of the 
Hirt DNA preparation was used to transform 40 ml of competent bacteria by 
electroporation. The resultant total number of bacterial colonies was quantified 

15 for each time point and the structure of circular intermediates was evaluated for 
greater than 20 plasmid clones for each time point from two independent 
experiments. Structural determinations were based on restriction enzyme 
analysis using PstI, SphI, Asel single and double digests together with Southem 
blotting against GPP, stuffer, and ITR probes. 

20 Fvahiatinn nf F9fl anH GFP pene expression in Hela cells. 

E2a gene expression was evaluated by immunofluorescent staining of 
Hela cells superinfected with El-deleted AdCMVlacZ (MOI= 0, 500, 5000 
particles/cell). Briefly, cells were fixed in methanol at -20**C for 10 minutes 
followed by air drying. Cells were then incubated at room temperature with 

25 hybridoma supernatant against Ad5 72kd DBP (Reich et al., 1983), followed by 
goat anti-mouse-FITC antibody (5 mg/ml) for 30 minutes at room temperature. 
In studies evaluating augmentation of AAV GPP transgene expression by 
adenovirus, Hela cells were harvested at 24 or 72 hours post-infection by 
trypsinization, resuspended in 2%PCS/PBS and evaluated by FACS analyses. 

30 Thresholds were set using uninfected controls and the percentage and/or the 
average relative fluorescent intensity was determined by sorting greater than 10^ 
cells per experiment condition. 
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Reqnmce Analysis of AAV Circular Intermediates. 

Sequence analysis of the ITR array within circular intermediates was 
performed using primers ELI 18 (5'-CGGGGGTCGTTGGGCGGTCA-3'; SEQ 
ID NO:l) and EL230 (5'-GGGCGGAGCCTATGGAAAA-3'; SEQ ID N0:2) 
5 which are nested to 5' and 3' ITR sequences, respectively. Both circular and 
linearized (with Smal which cuts within ITR sequences) plasmids were 
sequenced. 
Results 

Construction of rA AV S^hiittle Vector and Isolation of Circular Tntermediates. 

1 0 To circumvent the inability to retrieve pre-integration intermediates or as 

stable episomal forms resistant to nuclease digestion, an alternative strategy was 
developed to "trap" circular intermediates using a recombinant AAV shuttle 
vector. Recombinant AV.GFP3ori virus (Figure IB) was generated from a 
cw-acting plasmid (pCisAV.GFP3ori, Figure 1 A) by co-transfection in 293 cells 

15 with trans-acting plasmids encoding Rep and Cap viral genes. This viral vector 
(AV.GFP3ori) encoded the green fluorescent protein (GFP) reporter gene, a 
bacterial origin of replication (ori), and the bacterial ampicillin-resistance gene. 
Ori and ampicillin-resistance sequences encoded in this virus allow for the 
rescue of circular AAV genomes formed during the transduction process. 

20 - To test this strategy, Hela cells were infected with AV.GFP3ori 

(MOI=1000 particles/cell) and the abundance of circular intermediates was 
evaluated following transformation of low molecular weight cellular Hirt DNA 
into E. coll SURE cells. The presence of circular intermediates was inferred by 
retrievable ampicillin-resistant bacterial colonies. Structural features of circular 

25 intermediates were determined by restriction enzyme analysis and Southern 
blottmg with various regions of the provirus, including GFP, Stuffer, and ITR 
sequences. 

The predominant circular form isolated after transduction of Hela cells 
wiih AV.GFP3ori consisted of 4.7 kb monomer-sized molecules (Figure IC). 
30 Sphi digestions of these circular intermediates yielded characteristic 300 bp 
bands which hybridized to an ITR probe on Southern blots (Figure 2A). PstI, 
SphI, Asel single and double digests together with Southern blot analysis using 
GFP, Stuffer (data not shown), and ITR (Figure 2 A) probes confirmed the 
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structure of the circular intermediates as head-to-tail monomer genomes (Figure 
IC). In particular, PstI digests together with ITR Southern blots distinguish 
these head-to-tail circular intermediates from head-to-head or tail-to-tail circular 
dimers. Similar results obtained with studies on AV.GFP3ori infected 293 cells 
5 and primary fibroblasts have confirmed that monomer head-to-tail circular 
intermediates were also the most abundant form in these cell types. 

Because the predicted molecular weight of an intact head-to-tail ITR 
SphI fiagment would be approximately 360 bp, an anomalous migration in 
agarose gels might be due to the high secondary structure of inverted repeats 

1 0 within ITRs. To this end, the head-to-tail orientation of the ITRs, as predicted 
by Southern blot analysis, was confirmed using several sequencing strategies. 
First, the SphI ITR hybridizing fragment of a circular intermediates was 
subcloned into a secondary plasmid vector and sequenced with primers outside 
the ITR cloned sequences. These findings confirmed the head-to-tail orientation 

1 5 of ITRs. Additionally, sequence was obtained directly from six monomer 
circular intermediate clones using primers internal to both the 5' and 3' ITRs 
(Figure 2C). In these studies, circular intermediates were digested with Smal 
and the linear 4.6 kb plasmid was gel isolated prior to sequencing. Smal 
digestion (which relaxed the secondary structure of ITRs) was necessary to 

20 obtain sequence information within the ITRs. The sequencing results presented 
in Figure 2C confirmed the orientation of head-to-tail ITR arrays in these 
intermediates. 

Interestingly, sequencing also revealed several consistent base pair (bp) 
changes in four ofthe six clones analyzed (Figure 2C). These four clones (p79, 

25 p81, p87, and p88) had consistent two bp changes within the D-sequence [G->A 
(122bp) and A->G (125bp)], which always occurred together with the bp 
alterations m the p5 promoter [A->G (1 14bp) and A->C (1 15bp)]. No other 
consistent bp changes were noted except for two clones (p79 and p88) which 
demonstrated mutations just outside the 3 'ITR D-sequence [T->G (381bp) and 

30 T->C (383bp)]. 

Although head-to-tail circular intermediates were the most abundant 
forms present in Hirt DNA from rAAV infected Hela cells, several less frequent 
structures were also detected. These included monomer circularized AAV 
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genomes with one (pi 90) and three ITRs (p345) arranged in a head-to-tail 
fashion as well as several clones with an unknown structure lacking complete 
ITRs (p340) (Figure 2A). Such diversity within the ITR array may represent 
homologous recombination in vivo or in bacteria during amplification. However, 

5 previous studies demonstrating similar variations in ITR sequences of 

head- to-tail integrated genomes, suggest that such changes in the length of the 
ITR array may occur in vivo (Duan et al., 1997) Additionally, less frequent 
head-to-tail circularized multimer forms were predicted based on the variation in 
migration patterns of uncut plasmids which gave identical restriction patterns. 

1 0 Results shown in Figure 2B confimied the existence of monomer and dimer 
head-to-tail circular intermediates using partial digestion with an enzyme which 
cuts once in the AAV genome (Asel). Cumulative analysis of greater than 200 
independently isolated circular interaiediates from Hela cells demonstrated that 
head-to-tail circular AAV genomes occurred in greatest abundance as monomers 

1 5 (92%) and less frequently as multimers of greater than one genome (8%). 

To establish that head-to-tail circular intermediates were formed in vivo 
and not by non-specific bacterial recombination of linear AAV genomes present 
in the Hirt DNA, a set of reconstitution experiments was performed by which the 
same number of r AAV particles used for infection experiments were spiked into 

20 Hela cell lysates prior to Hirt preparations. In these studies, background 
bacterial amplification of Hirt DNA spiked with linear rAAV genomes was 
negligible (Figure 3D) and of the few isolated colonies obtained from these 
controls, none had a predicted head-to-tail structure as assessed by Southem blot 
restriction enzyme analysis (Figure 3E). Additionally, reconstitution 

25 experiments transforming bacterial with linearized dsDNA AAV genomes did 
not give rise to significant levels of replication competent plasmids or the 
characteristic head-to-tail structure associated with AAV circular intermediates. 
These findings confirm that circular intermediates do not likely arise from 
non-specific recombination or ligation events with either ssDNA or dsDNA 

30 linear AAV genomes in bacteria. Additional control experiments, demonstrating 
the lack of stuffer hybridizing sequences in AAV circular intermediates by 
Southem blotting, also confirm that these structures do not arise from 
contamination of viral stocks with pCisAV.GFP3ori plasmid. 
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Jhp fArmatinn nf hftflH-to-tail c irrnlflr AAV intftrmediatef; is augmented bv 
f^l^ppWnfcrtinn with Fl -deleted adennvirus. 

Many aspects of the wtAAV growth cycle are affected by helper 
adenovirus, including AAV DNA replication, transcription, sphcing, translation, 
5 and virion assembly. Such studies have provided concrete evidence that a subset 
of Ad early gene products provide helper functions for the wtAAV lytic cycle, 
including: Ela, Elb, E2a, E4 ORF6 and VAl RNA (Muzyczka, 1992). In this 
regard, one of the most critical factors v^hich is required for AAV rephcation is 
the 34 kD E4 protein (0RF6). Recent observations on the helper fimction of Ad 

1 0 in rAAV transduction have also demonstrated that Ad E4 0RF6 is essential for 
the augmentation of rAAV transgene expression seen with adenovirus 
co-infection (Ferrari et al., 1996; Fisher et al., 1996). According to these reports, 
the rate-limiting step enhanced by these adenoviral proteins is the conversion of 
single stranded AAV genomes to double stranded forms. 

1 5 Studies evaluating the kinetics of rAAV circular intermediate formation 

demonstrated a time-dependent increase in abundance which peaked at 24 hours 
post-infection in Hela cells and coincided with the onset of GFP transgene 
expression (Figure 3). To better understand the cellular mechanisms associated 
with AAV circular intermediate formation, the effects of adenoviral co-infection 

20 on this process were evaluated. The extent of transgene expression and circular 
intermediate formation in AV.GFP3ori infected Hela cells with or without 
co-infection with El -deleted recombinant adenovirus was compared. 

Although El -deleted adenoviruses are severely handicapped in their 
ability to synthesize viral gene products, at high MOIs of >5000 significant E2a 

25 protein expression was noted (Figure 3 A). As an indicator of transgene 

expression, the abundance and average relative intensity of GFP positive cells 
was determined against mock infected controls by fluorescent microscopy 
(Figure 3B) and FACS analysis (Figure 3C) at 72 hours post-infection. In accord 
with previous reports demonstrating augmentation in rAAV transgene expression 

30 by adenovirus (Ferrari et al., 1996; Fisher et al„ 1996), the extent of GFP 

transgene expression was dramatically increased at doses of adenovirus which 
led to viral gene expression (MOI>5000; Figures 3A-C). Additionally, 
persistence of rAAV transgene expression was also augmented by co-infection 
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with El -deleted adenovirus, as determined by GFP-expressing colony formation 
following serial passages (Figure 3C). 

If circular intermediates represent a molecular form of rAAV important 
for efficient and/or persistent transgene expression, augmentation of rAAV 
5 transgene expression by adenovirus might also modulate circular intermediate 
formation. In these studies, the abundance and time course of AAV circular 
intermediate formation was evaluated following superinfection with 
Ad.CMVLacZ. Resuhs from these experiments are shown in Figure 3D, which 
represents the total number of bacterial colonies (per 35 mm plate) obtained 

10 following transformation of £. coli with Hirt DNA isolated from Hela cells 
infected with AV.GFP3ori (1000 DNA particles/cell) with or without 
co-infection with AdCMVlacZ (5,000 particles/cell). An MOI of 5000 Ad 
particles/cell was chosen for these experiments since this level of adenovirus led 
to minimal cytopathic effect (CPE) with high levels of E2a expression. 

1 5 These studies demonstrated a nearly 2- fold augmentation by 

Ad.CMVLacZ in the total abundance of AAV rescued plasmid intermediates in 
E. coli (Figure 3D). Southern blot restriction enzyme analysis demonstrated that 
the predominant forms in both the presence and absence of adenovirus were 
head-to-tail monomer circular intermediates containing the diagnostic 300 bp 

20 ITR fragment following SphI digestion (Figure 3E). Additionally, results 
demonstrated that adenovirus co-infection led to an earlier time of onset and 
increased stabiUty of AAV head-to-tail monomer circular intermediates (Figures 
3E and F). For example, at 6 hours post-infection, head-to-tail circular 
intermediates were only present in Hela cells co-infected with adenovirus. 

25 Furthermore, a decline in the percentage of head-to-tail circular intermediate 
clones was seen at 48-72 hours post- AAV infection in the absence of adenovirus. 
In contrast, this decline was significantly blunted by the presence of helper 
adenovirus (Figure 3F). Based on these findings, it was concluded that certain 
adenoviral proteins produced by superinfection with El -deleted adenovirus were 

30 capable of modulating circular intermediates fonmation and stability during 
rAAV transduction. 
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Discussion 

In the present study, it was shown that circularization of linear AAV 
genomes occurs during rAAV transduction. Circularization appears to 
predominately occur as head-to-tail monomer genomes. However, the existence 
5 of less abundant circular multimer forms suggests that recombinational events 
subsequent to the initial infection may drive concatamerization of circular 
genomes. The diversity in the length of ITR arrays found within circular 
intermediates (i.e., 1-3 ITRs) also supports the notion that these forms may be 
highly recombinagenic. Of mechanistic interest in the formation of circular 

10 intemiediates is the uniformity of mutations observed in the D-sequences and 
nearby p5 promoter region and the confinement of these mutations to the 
5 '-ITRs. Although the etiology of these base pair changes is unknown, their 
uniformity suggests that they may have a direct role in the formation of circular 
intermediates and in increased stability. Recent findings, which suggest that an 

1 5 endogenous host single strand D-sequence binding protein is important in rAAV 
transduction, lend support to the potential involvement of this sequence in 
circular intermediate formation (Wang et al., 1997; Qing et al., 1998). 
Furthermore, it remains to be determined whether the in vivo formation of AAV 
circular intermediates occurs through the circularization of single or double 

20 stranded AAV genomes. 

By analogy, retroviral transduction intermediates have striking 
similarities to the current findings with AAV. Three DNA forms have been 
isolated following retroviral infection, including linear DNA with long terminal 
repeats (LTRs) at both ends, circular DNA with one LTR, and circular DNA 

25 with multiple LTRs (Panganiban, 1985). Although it is disputed which of these 
forms are the direct precursor to integration, the existence of circular retroviral 
genomes which also have similar repeat regions at the ends of their genomes 
suggests the potential for common mechanisms with the formation of AAV 
circular intermediates. These AAV circular intermediates could act as 

30 integration precursors and/or stable episomal genomes. 

The head-to-tail ITR structures found in AAV circular intermediates are 
most characteristic of latent integrated AAV genomes. In contrast, lytic phases 
of AAV growth are typically associated with head-to-head and tail-to-tail 
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replication form genomes. Hence, it is likely that circular intermediates represent 
a latent aspect of the AAV life cycle. The finding that co-infection with 
adenovirus leads to increased abundance and stability of AAV circular 
intermediates suggests a novel link between adenoviral helper functions and 
5 latent infection of AAV, 

Aspects of inverted head-to-tail ITRs, which include palindromic 
hairpins similar in structure to "HoUiday-like" junctions, might impart 
recombinagenic activity which aids in viral integration. Such HoUiday junctions 
have been shown to play critical roles in directing homologous recombination in 

1 0 bacteria through the processing of recombination intermediates by RuvABC 
proteins (West, 1997; Lee et al., 1998). Interestingly, a mammalian 
endonuclease, analogous to bacterial RuvC resolvase, has also been isolated 
from cell lines (Hyde et al., 1994). Despite the theoretical considerations which 
might suggest that circular AAV genomes have characteristics of preintegration 

1 5 intermediates, a study with recombinant retrovirus has demonstrated that 
palindromic LTR-LTR junctions of MMLV are not efficient substrates for 
proviral integration (Lobel et al., 1989). Nonetiieless, circular AAV genomes 
have been previously proposed as integration intermediates based on proviral 
structure (Linden et al, 1996). 

20 Example 2 

Methods 

Prnrinntinn of rA AV .Shuttle Vector. 

The m-acting plasmid (pCisAV.GFP3ori) used for rAAV production 
was generated by subcloning the Bsp 1201 /Not I fragment (743 bp) of the GFP 

25 transgene from pEGFP-1 (Clontech) between the CMV enhancer/promoter and 
SV40polyA by blunt-end hgation. A 2.5 kb cassette containing beta-lactamase 
and bacterial replication origin from pUC19 was blunt ligated down-stream of 
GFP reporter cassette. The ITR elements were derived from pSub201 .2 The 
entire plasmid contains a 4.7 kb AAV component flanked by a 2 kb stuffer 

30 sequence. The integrity of ITR sequences was confirmed by restriction analysis 
with Smal and PvuII, and by direct sequencing using a modified di-deoxy 
procedure which allowed for complete sequence through both 5' and 3' ITRs. 
Recombinant AAV stocks were generated by co-transfection of 
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pCisAV.GFPSori and pRep/Cap together with co-infection of recombinant 
Ad.CMVlacZ in 293 cells. The rAV.GFP3ori vims was subsequently purified 
through 3 rounds of CsCl banding as described in Duan et al., 1997. The typical 
yields from this viral preparation were 1012 DNA molecules/ml. 
5 DNA titers were determined by viral DNA slot blot hybridization against 

GFP ^^P-labeled probe with copy number plasmid standards. The absence of 
helper adenovirus was confirmed by histochemical staining of rAAV infected 
293 cells for beta-galactosidase, and no recombinant adenovirus was found in 
10'** particles of purified rAAV stocks. The absence of significant wtAAV 
10 contamination was confirmed by immunocytochemical staining of rAAV/ Ad 
co-infected 293 cells with anti-Rep antibodies. Transfection with pRep/Cap was 
used to confirm the specificity of immunocytochemical staining. No 
immunoreactive Rep staining was observed in 293 cells infected with 1 0*^ rAAV 
particles. 

15 Tsnlation of A A V rirr.nlar Tntermediatfts From Muscle. 

The tibialis anterior muscle of 4-5 week old C57BL/6 mice were infected 
with AV.GFP3ori (3 X 10^** particles) in Hepes buffered saline (30 nl). GFP 
expression was analyzed by direct immunofluorescence of fi^shly excised tissues 
and/or in formalin-fixed cryopreserved tissue sections in four independently 

20 injected muscles harvested at 0, 5, 10, 16, 22 and 80 days post-infection. Tissue 
sections were counter-stained with propidium iodide to identify nuclear DNA. 
Hirt DNA (Hirt, 1967) (20 ml per muscle sample) was isolated from at least 
three independent muscle specimen for each time point and used to transform £. 
coli SURE cells using 3 ml of Hirt with 40 ml of electrocompetent bacterial 

25 (approximately 1 x 10'* cfii/ug DNA, Strategene Inc.). The resultant total number 
of bacterial colonies was quantified for each time point and the abimdance of 
head-to-tail circular intermediates was evaluated for each time point (> 20 
bacterial clones analyzed) by PstI, Asel, SphI, and Pstl/Asel digestion, and 
confmned by Southern blot analysis using ITR, GFP and stuffer probes. The 

30 head-to-tail configuration in typical clones were also confirmed by dideoxy 

sequencing using primers ELI 18 (5'-CGGGGGTCGTTGGGCGGTCA-3'; SEQ 
ID N0:1) and EL230 (5'-GGGCGGAGCCTATGGAAAA-3'; SEQ ID N0:2) 
which are nested to 5' and 3' ITR sequences, respectively. Zero hour controls 
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were generated by mixing 3x10*° particles of AV.GFP3ori with control 
uninfected muscle lysates prior to Hirt DNA preparation. As described in Table 
1 , a number of additional controls for were performed to rule out non-specific 
recombination of linear AAV genomes in bacteria as a source for isolated 
5 circular intermediates. 



Table 1 . Control Experiments for Rescue of Circular Intermediates in Bacteria 



10 


Type of Input 
DNA 


Source of DNA 


Number of 
Molecules 


Number of 
Amp Resistant 
Bacterial 
Colonies 


Presence of 
Head-to-Tail 

Circular 
Intermediates' 




Purified rAW 


Hirt from Infected 
Muscle (22 day) 


3 x 10*' 


approximately 
5x 10^ 


Yes 




Punned rAAV 


Virus 
reconstituted into 
Uninfected 
Muscle Hirt' 


J X lU 


A 
U 


Nn 


15 


Linear ssDNA 
Encompassing 
rAAV 
Genome*" 


Isolated from 
Purified Virus 


3 X 10'^ 


2 


No 


20 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 


Isolated from 
proviral plasmid 
(Hindni/PvuHf 


3x10*** 


3 


No 


25 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 
+ ligase** 


Isolated from 
proviral plasmid 
(Hindlll/PvuII) 


3 x 10'' 


>6x 10^ 


Yes 



* Purified virus was reconstituted into muscle homogenates prior to preparation 
of Hirt DNA. 



30 ^ Viral DNA predominantly contained single stranded genomes as evident by 
Southern blot analysis against with ITR probe. However, small amount of 
dsDNA AAV genomes also existed and are likely due to reannealing of single 
stranded genomes during preparation. Purified viral DNA concentrations were 
determined by OD^^ and 75 ng representing approximately 3x10*' viral 

35 genomes were used for transformation of bacteria. 

^ Hindlll/PvuII digestion was used to remove the entire rAAV genome from 
pcisAV.GFP3ori. Hindlll and PvuII leave 10 and 0 bps of flanking sequence 
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outside the 5' and 3' ITRs, respectively. The hnear dsDNA fragment (4.7 kb) 
was gel isolated following blunting with T4 DNA polymerase and the DNA 
concentration determined by ODj^o- One hundred and fifty ng of linear fragment 
representing approximately 3 x 10*° viral genomes were used for transformation 
5 of bacteria. 

Linear dsDNA viral genomes (Hindlll/PvuII blunted fragment) were treated 
with T4 DNA Ugase prior to transformation of bacteria. 

1 0 ^ The presence of head-to-tail circular AAV intermediates were confirmed by 
restriction enzyme digestion (Asel, PstI, and SphI) and Southern blotting against 
ITR probe. 

Frartinnatinn nf m nsrle Hirt DNA preparations. 

15 Preparative-scale fractionation of the muscle Hirt DNA was performed 

by 1% agarose gel electrophoresis using the Bio-Rad Mini Prep Cell (Catalog 
#170-2908). A 4.5 ml (10.5 cm) tubular gel containing 1 x TBE buffer was 
poured according to manufacturers specification. A total of 20 ml Hirt 
preparation from one entire muscle sample was loaded on top of the gel. 

20 Electrophoresis was carried out at a constant current of 10 mA over a period of 
5 hours. Sample eluent was drawn from the preparative gel apparatus by a 
peristaltic pump at a rate of 100 ml/min and eluted into a fiction collector at 
250 ml/fraction. The collected DNA was subsequently concentrated by standard 
ethanoi precipitation and used to transform SURE bacterial cells by 

25 electroporation as described above. 

In vitrn PersiRtence nf A A V C\m\Ur Intermediates. 

Transgene expression and persistence of AAV circular intermediate 
plasmid clones were evaluated following transient transfection in Hela and 293 
cells. Subconfluent monolayers of Hela cells in 24-well dishes were transfected 

30 with 0,5 mg of either AAV circular intermediates (p8 1 or p87) or pCMVGFP 
using Lipofectamine (Gibco BRL Inc.). The cultures were then incubated for 5 
hours in serum free DMEM followed by incubation in DMEM supplemented 
with 10% fetal bovine serum. All plasmid DNA samples used for transfections 
were spiked with pRSVlacZ (0.5 mg) as an internal control for transfection 

35 efficiency. At 48 hours post-transfection, cells were passaged at a 1 :10 dilution 
and allowed to grow to confluency (day 5), at which time GFP clones were 
quantified for size and abundance using direct fluorescent microscopy. The 
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percent of beta-galactosidase-expressing cells was also quantified at this time 
point by X-gal staining. At 5 days, cells were passaged an additional time (1:15 
dilution) GFP clones were quantified again at day 10. The persistence of 
plasmid DNA at passage-5, 7, and 10 days post-transfection was evaluated by 
5 Southern blot analysis of total cellular DNA using ^^P-labeled GFP probes. To 
determine whether the head-to-tail ITR array within circular intermediates was 
responsible for increases in the persistence of GFP expression, the head-to-tail 
ITR DNA element was subcloned into the pGL3 luciferase plasmid to generate 
pGL3(ITR). The head-to-tail ITR DNA element was isolated firom a monomer 

1 0 circular intermediate (p8 1 ) by Aatll and Haell double digestion and 

subsequently inserted into the Sail site of pGL3 (Promega) by blunt ligation. 
The resultant plasmid pGL3(ITR) contains the luciferase reporter and 
head-to-tail ITR element 3' to the polyA site. The integrity of the ITR DNA 
element within this plasmid was confirmed by sequencing. The persistence of 

15 transgene expression from pGL3(ITR) was compared to that of pGL3 by 
luciferase assays on transiently transfected Hela cells as described above and 
analyzed at 10 days (passage-2). Transfection efficiencies were normalized 
using a dual renilla luciferase reporter vector (pRLS V40, Promega). 
Results 

20 AAV rirmi lar Intermedi ates Represent Stable Ppisomal Forms of Viral DNA 
Associated with T.ong-term Persistence of Transgene Rypression in Muscle. 

To evaluate the molecular characteristics of rAAV genomes in muscle, a 
rAAV shuttle viral vector (AV.GFP3ori) was utihzed which harbors an 
ampicillin resistance gene, bacterial origin of replication, and GFP reporter gene 

25 (Figure 1 A). This recombinant virus was used to evaluate the presence of 

circular intermediates by bacterial rescue of replication competent plasmids. In 
these studies, dehvery of AV.GFP3ori (3 x 1010 particles) to the tibiaHs muscle 
of mice led to GFP transgene expression which peaked at 22 days and remained 
stable for at least 80 days (Figure 4A). These results confirmed previous 

30 successes in rAAV mediated gene transfer to muscle (Kessler et al., 1996; 
Herzog et al, 1997; Xiao et al., 1996; Clark et al., 1997; Fisher et al., 1997). 
The formation of circular intermediates was evaluated by £. coli transformation 
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of Hirt DNA harvested from muscle at 0, 5, 10, 16, 22, and 80 days 
post-infection with AV.GFP3ori. 

In these muscle samples, circular intemiediates were found to have a 
characteristic head-to-tail structure witii 1-2 ITR repeats. The most abundant 
5 form included two inverted ITRs within a circularized genome (Figure 4B, clone 
pi 7). This figure also depicts a less frequent fonn (< 5%) of circular 
intermediates observed, p439, with undetermined structure. When this type of 
repHcation competent plasmid was seen, it was not included in the 
quantification of head-to-tail circular intennediates since its structure could not 

1 0 be conclusively determined. The total abundance of muscle Hirt derived 
head-to-tail circular intermediates (with 1-2 ITRs) demonstrated a 
time-dependent increase that peaked with transgene expression at 22 days and 
slightly decreased by day 80 (Figure 5A). Increased diversity in the length of 
ITR arrays within circular intermediates was seen at longer time points. For 

1 5 example, Figure 5B demonstrates several isolated circular intennediates with 1-3 
ITRs isolated from 80 days muscle Hirt samples. This is in contrast to the more 
uniform structure of circular intennediates with two ITRs in a head-to-tail 
conforaiation at 5-22 days post-infection. 

To evaluate the potential for artifactual rescue of linear rAAV genomes 

20 by recombination in bacteria, several control experiments were performed. First, 
uninfected control muscle Hirt preparations, spiked with an equal amount of 
rAAV virus used for in vivo infection of muscles, failed to give rise to 
rephcating plasmids following transformation of E.coli. Second, when a blunted 
linear double stranded Hindlll/PvuII fragment isolated from pcisAV.GFP3ori 

25 (encompassing the entire rAAV genome) was used to transform bacteria, no 
ampicillin resistant bacterial colonies were obtained. The addition of T4 ligase 
to this firagment, however, led to significant numbers of bacterial colonies. 
Third, when purified single stranded rAAV DNA was used for transformation, 
no bacterial colonies were obtained. As summarized in Table 1, these results 

30 confirm that in the absence of productive infection, rAAV genomes themselves 
are incapable of recombining into replication competent plasmids in bacteria. 
Hence, in vivo circularization of rAAV genomes is a prerequisite for rescuing 
autonomously replicating plasmids in E, coli with this shuttle vector. 
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Mnleriilar weight nf rirriilar mtermediates snggeRt a rnnversion from monomer 
tn miiltimer forms over time. 

To further characterize the circular intermediates isolated from muscles, 
Hirt samples from 22 days and 80 days post-infected muscles were size 
5 fractionated by continuous-flow gel electrophoresis (BioRad). As shown in 
Figure 6 , the majority of circular intermediates at 22 days post-infection size 
fractionated at a molecular weight of less than 3 Kbp. Very few clones were 
isolated from fractions between 3 to 5 kb and no clones were obtained from 
fractions larger than 5 kb at this time point. Furthermore, this size fractionated 

10 molecular weight of ik vivo Hirt derived circular intermediates at 22 day time 
points correlated with that of head-to-tail monomer undigested circular 
intermediate plasmids rescued in bacteria from this same time point 
(approximately 2.5 kb). These data suggest that at early time points 
post-infection in muscle, the predominant form of circular intermediates likely 

1 5 occurs as monomer genomes. The lower mobility of this fraction as compared to 
replication form monomer (Rfin=4,7 kb) and dimer (Rfd=9.4 kb) genomes 
provides indirect evidence that these forms are not responsible for rescued 
plasmids in these Hirt samples. Interestingly, when 80 day muscle Hirt samples 
were size fractionated, more clones were retrieved from higher molecular weight 

20 fractions ranging from 3-12 kb (Figure 6). This shift in the molecular weight of 
circular intermediates indicates the potential for recombination between 
monomer forms in the generation of large circular multimer genomes. Such 
concatamerization has been previously observed in muscle and has traditionally 
been hypothesized to involve linear integrated forms of the AAV genome 

25 (Herzog et al., 1997; Xiao et al., 1996; Clark et al., 1997; Fisher et ah, 1997). 
This data sheds new light on the molecular characteristics of these persistent 
AAV genomes and suggests that they are in fact circular and episomal. Based on 
yields of retrievable circular plasmids reconstituted in Hirt DNA, the efficiency 
of bacterial transformation, and the initial innoculum of virus, we estimate that 

30 approximately 1 in 400 viral DNA particles circularize following infection in 
muscle (Table 2). 
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Table 2. Yield of Circular Intermediate Isolation from Hirt DNA 



Bacterial 
Transformation 


Starting Number 
of Plasmid or 
AAV Genomes 


Actual Number 
of Amp' cfu 


Adjusted Yield 


Hirt DNA from rAAV 
Infected Muscle ^ 


3x 10**^ molecules 


5 X 10^ cfu 


5 X 10^ cfu^ 


Hirt DNA + 230ng 
LacZ Plasmid*"" 


3 X 10*** molecules 


2x10' cfii^ 


2x10^ cfii 


230 ng LacZ Plasmid ^ 


3x 10'^ molecules 


2x 10^ cfu 





^ The actual amount of Hirt used for transformation was 3/20 the entire Hirt 
DNA. The numbers have been adjusted to reflect viral innoculimi and yields for 
the entire muscle. 

15 

^ Plasmid DNA was spiked into mock infected muscle homogenates prior to 
isolation of Hirt DNA. This reconstituted Hirt DNA was then used for 
transformation of bacteria. 

20 The actual microgram amounts of plasmid used in reconstitution experiments 
was 10 ng. The nimibers have been adjusted for comparison to normalize the 
number of plasmids genomes to that used in AAV experiments. Control LacZ 
plasmid was approximately 7000 bp with a molecular weight of 4.6 x 10* 
g/mole. 

25 

The average of several experiments indicates an approximate 100-fold 
reduction in the nimiber of cfii recovered from bacterial transformations with 
DNA isolated from Hirt extract spiked with plasmids as compared to 
transformation with an equivalent amoxmt of plasmid DNA alone. 

30 

^ Adjusted yield indicate approximately 1 in 400 AAV genomes circularize in 
vivo. 

Given the fact that not all rAAV particles likely contain functional DNA 
35 molecules and intermediates may integrate, these calculations may represent an 
underestimation. 

AAV rimilflr Tntermfiriiates D pTnnngtrate Increased Persistence as Plasmid 

Based Vpctors, 

40 Based on the finding that circular AAV intermediates were associated 

with long term persistence of transgene expression in muscle, rAAV circular 
head-to-tail intermediates may be molecular structures of the AAV genome 
associated with the latent life cycle and increased episomal stability. Several 
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aspects of the structure of AAV circular intermediates may account for their 
increased stabihty in vivo. First, circularization of AAV genomes may create a 
nuclease resistant conformation. Secondly, since the only viral sequences 
contained within circular intermediates are the head-to-tail ITR array, these 
5 sequences might bind cellular factors capable of stabilizing these structures in 
vivo. Several studies have demonstrated increased persistence of transgene 
expression with plasmid DNA encoding viral ITRs (Philip et al., 1994; Vieweg 
et al., 1995). The results described above provide a fimctional explanation for 
the increased persistence through the association with circular intermediate 
1 0 formation as part of the AAV life cycle. 

To more closely evaluate the persistence of AAV head-to-tail circular 
intermediates, several in vitro experiments were performed by transfecting these 
intermediates into Hela cells and assessing the stability of plasmid DNA and 
transgene expression by GFP clonal expansion. Results from Hela cell 
1 5 transfection experiments demonstrated that two monomer head-to-tail circular 
intermediates (p81 and p87) studied gave rise to a 10-fold higher number of five 
and ten day transgene-expressing clones, as compared to a control pCMVGFP 
plasmid lacking the ITR sequences (Figures 7A and B). Additionally, the size of 
GFP positive colonies at 5 days post-transfection was three-fold larger in Hela 
20 cells transfected with p81 and p87, as compared to the pCMVGFP control vector 
(Figures 7A and B). These studies suggest the AAV circular intermediates have 
increased stability of transgene expression and substantiate findings in muscle. 

To confirm the increased molecular persistence of head-to-tail circular 
intermediates following transfection into Hela cells, total DNA (low and high 
25 molecular weight) was isolated from cultures of pCMVGFP and p81 transfected 
Hela cells at various passages post-transfection and analyzed by Southern 
blotting. Southem blots hybridized to ^^P-labeled GFP probes demonstrated a 
significantly higher level of p81 plasmid DNA at passage-7 as compared to the 
control vector lacking the head-to-tail ITR sequence (Figure 7C). The majority 
30 of signal in undigested DNA samples was associated with a 4.7 kb band 

migrating at the approximate size of the uncut monomer plasmids. Together 
with the fact that the majority of signal from all cell cultures in Figure 7C 
disappeared by passage- 10, these data suggest that these plasmids predominantly 
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remained episomal. Thus, in both muscle and Hela cells, increased persistence 
of AAV circular intermediates is correlated with stable transgene expression. 
TTR arrayR are responsible for inrreased persistence. 

To investigate whether the head-to-tail ITR DNA element was 
5 responsible for the increased persistence of circular intermediates, we cloned this 
DNA element into a secondary luciferase vector (pGL3) to give rise to 
pGL3{ITR). Transient transfection experiments in Hela cells demonstrated a 
five-fold increase in the persistence of luciferase expression in serially-passaged 
cultures at 10 days in pGL3(ITR) as compared to that of pGL3 transfected 

10 (Figure 7D). These findings support the hypothesis that the head-to-tail ITR 
DNA element contained within circular intemiediates is responsible for 
mediating the increased persistence of transgene expression and suggest a 
mechanism by which these molecular intermediates may confer stabihty to AAV 
genomes in vivo. Furthermore, increases in the stability of transgene expression 

1 5 conferred by this element appear to be primarily context independent, since the 
head-to-tail ITR element was 3' to the luciferase gene in pGL3(ITR) and 5' to 
the GFP transgene in AAV circular intermediates. 
Discussion 

Characterization of integrated proviral structures in different cell lines 
20 has demonstrated head-to-tail genomes as the predominant structural foms for 
both wild type and recombinant AAV (McLaughUn et al., 1988; Cheung et al, 
1980; Duan et al., 1997). This is in contrast to the head-to-head and tail-to-tail 
structures observed in AAV replication intranediates (Rfin and Rfd). Both Rfin 
and Rfd configurations have also been demonstrated in rAAV infected cells and 
25 enhanced conversion of ssAAV genomes to double stranded Rfin and Rfd forms 
has been suggested as a mechanism for augmentation of rAAV transduction by 
adenovirus in cell lines (Ferrari et al, 1996; Fisher et al, 1996). However, it is 
plausible that the mechanisms responsible for the fomiation of Rfin and Rfd 
molecules are different from pathways which lead to long-term transgene 
30 expression. In support of this hypothesis is a recent study evaluating 

augmentation of rAAV transgene expression by adenovirus in liver (Snyder et 
al., 1997). These studies have demonstrated that co-infection of the liver with 
adenovirus and rAAV enhances short term transgene expression while long term 
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expression was no different than rAAV alone. The exact mechanism for the 
formation of head-to-tail circular intermediates is not clear, however similar 
structures have been demonstrated to act as pre-integration intermediates for 
retrovirus (V annus, 1982). In this regard, circularized retroviral genomes with 
5 one and two viral LTRs have been proposed. In addition, circular pre-integration 
intermediates have also been suggested by recent studies on wtAAV integration 
(Linden et al., 1996b). The demonstration that circular intermediates exist in 
rAAV infected muscle explains several features of latent phase infection with 
rAAV vectors including proviral structure and stable episomal persistence. 

1 0 Previous studies have suggested that rAAV genomes delivered to muscle 

might persist as head-to-tail concatamers (Herzog et al., 1997; Clark et al., 1997; 
Fisher et al., 1997). However, it is currently unknown whether these 
concatamers exist as free episomes or as integrated proviruses in the host 
genome. The results described above, i.e., demonstrating prolonged persistence 

1 5 of head-to-tail circular intennediates at 80 days post-infection, suggest that a 
large percentage of rAAV genomes may remain episomal. The conversion of 
monomer circularized genomes to larger circularized multimers appears to be an 
aspect associated with long term persistence and likely represents 
recombinational events between monomer intermediates. Although the bacterial 

20 rescue strategy was not capable of satisfactorily addressing the size of multimers, 
our modified approach to size fractionating Hirt DNA prior to bacterial rescue of 
intermediates lends support to this hypothesis. Additional supportive evidence 
for increased recombination over time is the finding that greater variability in the 
length of ITR arrays was observed at longer time points post-infection. For 

25 example, at 5-22 days the majority of circular intermediates contained 2 ITRs in 
a head-to-tail fashion: This is in contrast to 80 day time points where the lengths 
of ITR arrays ranges from 1-3 ITRs. Such diversity of ITR arrays in muscle 
infected with AAV has been previously found using PCR approaches (Herzog et 
al, 1997; Fisher et al, 1997). In addition, the 30% decline in the abundance of 

30 circular intermediates in muscle between 22 and 80 days also supports a 

hypothesis that these molecular fomis of AAV may represent pre-integration 
complexes. 
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Given the fact that circular intermediates had long term persistence in 
muscle, certain structural features of these intermediates may affect episomal 
stability of DNA. Previous studies have noted increased persistence of transgene 
expression from plasmids encoding AAV ITRs (Philip et al., 1994; Vieweg et 

5 al., 1995). However, the physiologic significance of this finding has remained 
elusive. The present study, demonstrating the head-to-tail ITR arrays isolated 
from AAV circular interaiediates can confer increased episomal persistence to 
plasmids following transfection in cell lines, gives a mechanistic firework for 
ITR effects on plasmid persistence. Furthermore, the correlation that AAV 

10 circular intermediates have increased persistence in cell lines in vitro, lends 
support to the hypothesis that these structures represent stable episomal forms 
following rAAV transduction in muscle. Stability of circular intermediates in 
vivo might be mediated by the binding of cellular factors to "HoUiday-like" 
jimctions in ITR arrays which stabilize or protect DNA from degradation. 

1 5 rAAV has been shown to be an efficient vector for expressing transgenes 

in various tissues in addition to muscle, such as brain, retina, liver, lung, and 
hematopoetic cells (Snyder et al., 1997; Muzyczka, 1992; Kaplitt et al., 1994; 
Walsh et al., 1994; Halbert et al., 1997; Koeberl et al., 1997; Conrad et al., 1996; 
Bennett et al., 1997; Flannery et al., 1997). Despite these advances in the 

20 application of rAAV, the mechanisms of in vivo rAAV-mediated transduction 
and persistence of transgene expression still remain imclear. Such questions as 
to the molecular state of rAAV following in vivo delivery is highly relevant to 
the clinical application of this viral vector. For example, should rAAV primarily 
persist as an randomly integrated provirus, the potential for insertional 

25 mutagenesis could present a major theoretical obstacle in the use of this vector 
due to the potential for mutational oncogenesis. The demonstration that rAAV 
can persist as episomes suggest that random integration and associated risks of 
malignancy may not be a major concern for this viral vector system. 
Additionally, the molecular determinants of AAV circular intermediates 

30 associated with increased persistence in cell lines appear to be contained within 
the DNA elements encompassing the inverted ITRs. The isolation of this 
naturally occurring viral DNA element, which forms as part of the AAV life 
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cycle and acts to stabilize circular episomal DNA, may prove useful in 
increasing the efficacy of both viral and non-viral gene therapy vectors. 

Fvample 3 

FviHfinr.fi for Tncreased Rpisomal PerRiste nrp nf AAV Circular Intermediates in a 
5 Mndfi] for in utPrn Plasmid-Rased Gene Therapy 

Persistence of AAV circular intermediates were assessed by injection of 
plasmid DNA directly into the pronucleus of fertilized ^e/iopu^ oocytes. 
Twenty-five ng of the p81 isolate of AAV circular intermediates was injected at 
the single cell stage of fertilized A^ewopw^ oocytes. This plasmid was compared 

10 to the proviral plasmid pCisAV.GFP3ori, which contains two ITRs separated by 
stuffer sequence in an alternative confirmation to ITRs in p81. Figure 13 depicts 
the persistence of GFP plasmids as assessed by direct fluorescence of GFP. At 
this state of tadpole development, the fertilized oocyte has expanded firom a 
single cell to approximately 10^ cells. 

1 5 These studies confirm that AAV circular intermediates (p8 1 ) confer a 

higher level of stability in development Xenopus oocytes than plasmids 
containing similar transcriptional elements and ITR sequences in an alternative 
confirmation. Given that in the case of p81 injected oocytes, tadpoles are 
completely fluorescent, the data suggests that some level of integration may have 

20 occurred. 

Fyample 4 

T.ipnsnme Mediated Transfer of Vfirtnrg nf the Invention to the Airwav and 

Muscle 

Studies evaluating the mechanisms of recombinant adeno-associated 
25 virus (AAV) transduction have identified a novel molecular intermediate 
responsible for episomal persistence. This intermediate is characterized by a 
circularized AAV genome with head-to-tail ITR repeats. Circular intermediates 
of rAAV were identified using a recombinant shuttle vector capable of 
propagating circularized viral genomes in bacteria. Pivotal experiments in cell 
30 lines demonstrate diat the formation and persistence of these circular 
intermediates are augmented in the presence of helper adenovirus. These 
findings suggest that cellular factors induced by adenoviral gene expression may 
modulate both the formation and/or persistence of AAV circular intermediates. 
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Furthennore, studies in muscle have demonstrated that following rAAV 
infection, the formation and persistence of AAV circular intermediates correlates 
with the onset and maintenance (at 80 days) of transgene expression, 
respectively. Moreover, a 300 bp fragment encompassing the head-to-tail 
5 inverted ITR repeats found in AAV circular intermediates when cloned into 
heterologous expression plasmids can confer increased stability to those 
plasmids in HeLa cells. The structural aspects of AAV circular intermediates 
may lead to development of non-viral, plasmid based, gene transfer vectors with 
increased persistence of transgene expression. 

1 0 To determine whether AAV circular intermediates which differ in length 

and/or sequence of the ITR array are more efficacious plasmid based vectors for 
liposome-mediated gene transfer to the airway and muscle, several distinct forms 
of AAV circular intermediates are evaluated as plasmid-based delivery systems 
in three model systems of the airway including: 1) in vitro polarized primary 

15 airway epithelial monolayers, 2) mouse lung, and 3) human bronchial 

xenografts. Persistence is evaluated at both the level of transgene expression 
(using GFP and luciferase reporters) and at the level of episomal and integrated 
transgene derived DNA. Studies are perforaied to assess whether integration can 
be specifically enhanced by co-transfection with Rep DNA or mRNA. These 

20 studies also evaluate both the extent of integration and site specificity to AVSl 
sites in chromosome 19 of human model systems. 

Gene therapy using plasmid-based dehvery systems have encountered 
several obstacles to efficient transgene expression. These obstacles include 
transient expression of transgenes and rapid degradation of DNA. In contrast, 

25 viruses have developed efficient mechanisms for transducing cells and 

expressing encoded viral genes. The molecular characteristics of AAV circular 
intermediates which confer increased persistence of transgene expression include 
a DNA element encompassing the head-to-tail ITR. Based on the findings that 
circular intermediates have increased episomal persistence in muscle following 

30 rAAV transduction, these structures may also have increased persistence as 
plasmid-based vehicles to the airway. Interestingly, several naturally occurring 
mutations which are found in approximately 50% of AAV circular intermediates 
affect the stability of the intermediate. 
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Several findings evaluating the efficiency of AAV circular intermediate 
formation from recombinant viral vectors have suggested that these structures 
are augmented in abundance by the presence of the E2a adenoviral gene product. 
These molecular structures may represent preintegration intermediates which, in 

5 the case of wild-type AAV, would efficiently integrate into the cellular genome 
by Rep facilitated mechanisms. However, in the case of recombinant AAV 
genomes (in the absence of Rep proteins), evidence suggests that these structures 
have increased episomal stability. To test whether exogenous addition of Rep 
and/or E2a can increase the efficacy of AAV circular intermediates by 

1 0 modulating their stability and/or integration, co-transfection methods with Rep 
encoding plasmids and mRNA are conducted. Additionally, exogenously 
supplied E2a DNA binding protein (DBP) may also enhance stability of AAV 
circular intermediates. Rep may increase the integration of circular 
intermediates while E2a may increase their episomal stability. Several 

15 observations including the association of E2a DBP with AAV genomes in the 
nucleus support a direct interaction between DBP and AAV circular 
mtermediates. Furthermore, if DBP associates with AAV circular intermediates, 
its encoded nuclear localization sequence (NLS) may enhance nuclear 
sequestration of these plasmids in the nucleus. Alternatively, E2a may act to 

20 alter the persistence of AAV circular intermediates through the induction of 
cellular factors which interact with the ITR array. 

Liposome mediated gene transfer to the airway has considerable 
advantages due to the low level of toxicity. However, limitations include 
transient low level expression in differentiated airway epithelia. Despite this 

25 apparent limitation, several laboratories have had considerable success with the 
use of cationic liposoine-mediated gene transfer in several animal models 
including mouse and rat lung, and numerous laboratories have pursued clinical 
trials, which suggested that these vehicles may show promise for gene therapy of 
the cystic fibrosis (CF) lung. Thus, delivery of the present vectors in plasmid 

30 form via liposomes may be a safe and effective vehicle for gene transfer to the 
airway. 

To assess whether AAV circular intermediates may also have increased 
persistence in airway epithelial cells as seen in Hela cells, several distinct forms 
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of circular intermediates delivered by liposome-mediated transfection into 
primary airway epithelial cells, are evaluated. Based on the diversity of ITR 
repeat elements between various isolated circular intermediates (i.e., including 0, 
1, 2, and 3 ITRs), circular intermediates isolated fix)m later time points in muscle 
5 may have been naturally selected for increased stability in vivo. Hence, the 
structural consistencies between AAV circular intermediates are identified which 
give increased persistence as plasmid based vectors for gene transfer. 

Circular intermediates containing the GFP reporter gene and 1, 2, and 3 
ITRs are transfected into primary airway cultures and polarized epithelial cell 

1 0 monolayers using the cationic lipid GL-67 (Genzyme Inc.). DNA to lipid ratios 
are optimized using a luciferase reporter. Additionally, the addition of EGTA, or 
the use of calcium-free media, can increase the extent of gene transfer about 10- 
fold, and may be included to enhance gene transfer to polarized epithelial 
monolayers. To evaluate persistence and expression of transgenes from circular 

1 5 intermediates, direct fluorescent microscopy and Southern blotting of both Hirt 
and genomic DNA with GFP P^^-labeled probes are utilized. Proliferating 
cultures of primary airway epithehal cells can be passaged up to 4 times during 
this analysis. In contrast, polarized epithelial monolayers are evaluated at 1 
week intervals for DNA persistence for up to 6 weeks. Since GFP transgene 

20 expression may be low and difficult to detect by direct fluorescence, GFP is 
quantitated by fluorometer of cell lysates. 

Following AAV transduction, circular intermediates may form within 
cells and certain structures of these intermediates may persist by virtue of 
affinity for cellular factors which bind at ITR arrays. If this is true, then it may 

25 be possible to select for and isolate optimal circular intermediates with increased 
persistence in airway cells by batch screening of circular intermediates pools 
from rAAV infected airway epithelia. 

Primary airway epithelia cell cultures are infected with AV. GFPSori 
(MOIs of 1 000 to 1 0,000 DNA part/cell) and low molecular Hirt DNA is 

30 prepared at 5-1 5 days post-infection. Hirt DNA containing circular 

intermediates from rAAV infected cells is used to then transfect primary airway 
epithehal cells from which Hirt DNA is prepared at 5-15 days post-transfection. 
This second Hirt isolation is then used to isolate replication competent plasmids 
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following transformation into bacteria. This selection process may give rise to 
those populations of circular intermediates with increased episomal persistence 
in airway epithelial cells. Selected clones of circular intennediate plasmids 
isolated by this procedure are then tested individually for increased persistence 
5 following liposome mediated transfection. These studies are performed in a 
batch type screening in 24 well plates using two serial passages for persistence. 
Once plasmids having increased persistence are isolated, their structure and 
sequence of ITR arrays are characterized. Since screening is performed on 
small-scale cultures, it may be necessary to implement semi-quantitative 

10 screening for DNA persistence within the first round of transfection using PCR 
methods. Candidate plasmids with a high level of increased persistence as 
compared to control plasmids which lack ITR sequences but contain the identical 
promoter-reporter element, are evaluated on a larger scale transfection amenable 
to analysis by Southern blotting of total DNA. 

1 5 To evaluate selected circular interaiediate structures in vivo, two models 

including mouse lung and the human bronchial xenograft are employed. 10 wk 
BalbC mice are transfected with GL-67/DNA complexes at a ratio of 25 ^ig 
plasmid/25 \ig lipid in an iso-osmotic solution of Dextrose. At 1, 5, 10, 15, and 
20 days post-transfection lungs of mice are harvested for immunofluorescent 

20 detection of GFP in formalin fixed sections and for quantitative fluorometry of 
tissue lysates. Southern blots are employed to evaluate the persistence of 
plasmids in Hirt and genomic DNA. In addition to evaluating the persistence of 
selected circular intermediates which have the highest level of persistence with 
in vitro models, luciferase constructs are evaluated in which the ITR array has 

25 been cloned either 5 ' or 3 ' to the reporter gene. Furthermore, the use of 

luciferase reporters allows for more sensitive assessment of transgene activity in 
cell lysates. 

Similarly, //2 vivo persistence of transfected circular intermediates and 
heterologous plasmids containing ITR arrays found within circular intermediates 
30 is evaluated in human bronchial xenografts. 

Findings evaluating the effects of adenoviral co-infection on circular 
intermediate formation and persistence have suggested that E2a DBF leads to a 
10-fold increase in the abundance of circular intermediates as compared to E2 
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deleted virus. Furthermore, studies with El -deleted virus have demonstrated that 
the persistence of circular intermediates in Hela cells is increased at 72 hours 
post-infection. These studies suggest that E2a DBP may augment circular 
intermediate formation and/or increase the stability of these structures by an 
5 unknown mechanism. E2a DBP may interact directly with circularized genomes 
and/or induce cellular factors which interact with sequences in these AAV 
genomes. Since DBP encodes an NLS, this protein may act to shuttle circular 
intermediates to regions of nucleus that allow for increased stability of these 
stmctures. NLS sequences have been shown to cooperatively interact with 

1 0 nucleolar targeting sequences and hence we will also evaluate if subnuclear 
targeting is important in maintaining the increased stability of circular 
intermediates containing ITR arrays. Furthermore, it is currently unknown 
where circular intermediates form in the cell and it remains plausible that they 
may form in the cytoplasm or nucleus. Hence, if DBP associates directly with 

1 5 circular intermediates, it may act as an NLS for DNA to enter the nucleus as 
well. 

Several in vitro reconstitution models are used to investigate the 
interaction of circular intermediates with DBP and their affect on in vivo 
persistence following DNA transfection in Hela cells. Furthermore, results 

20 evaluating the affects of various mutant adenoviral vectors on circular 

intermediate and Rfin/Rfd formation have suggested that these two types of 
intermediates occur by independent pathways indicative of latent and lytic 
infection, respectively. In the setting of wild type AAV, circular intermediates 
may be pre-integration complexes, which in the presence of Rep, efficiently 

25 integrate into the host genome. In contrast, in the absence of Rep, circular 
intermediates may accumulate episomally in rAAV infected cells. To this end, 
methods of supplementing Rep function may be capable of enhancing integration 
of plasmid based delivery of AAV circular intermediates. Thus, experiments in 
which co-transfection of circular intermediate plasmids with Rep expression 

30 plasmids or mRNA are conducted. 

To investigate whether DBP can augment the stability of circular 
intermediates by increasing targeting to the nucleus, a Hela cell line (gmDBP6) 
is utihzed which encodes an inducible E2a gene under a dexamethasone 
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responsible element. This cell line gives rise to high levels of DBP in nuclear 
extracts by Western blot following treatment with dexamethasone. gmDBP6 
ceils (+/- DEX) are transfected with various AAV circular intermediate plasmids 
containing 0, 1, 2, and 3 ITRs and total cellular and nuclear plasmid content 
5 evaluated by subcellular fractionation using Southern blotting against GFP 
probes. The time course of these studies is initially within the range of 12 hours 
to 4 days post-transfection. Transgene expression is evaluated by fluorometry 
(in cell lysates), and fluorescent microscopy (in viable cells), for GFP and 
luminescence for luciferase. Hela cells have demonstrated that immediate 
1 0 increases in transgene expression from AAV GFP circular intermediates as 
compared to control GFP plasmids occur as early as 24 hours post-transfection. 
Thus, certain cellular factors may facilitate an immediate accumulation of 
circular intermediates in the nucleus. DBP may invoke this increase by either 
direct interactions with ITR sequences or by the induction of cellular factors. To 
15 evaluate the potential for direct interactions between DBP and circular 

intermediates, various form of ITR arrays found within circular intermediates are 
end-labeled with y-ATP^^ and evaluated for binding by electrophoretic mobility 
shift assays to nuclear extracts from gmDBP6 cells (+/- DEX). Supershifts, with 
DBP antibodies and competition experiments with cold FTR sequences and non- 
20 specific DNA, are used as controls for specific binding. 

In a second model system aimed at evaluating the potential of DBP for 
shutilmg and/or sequestering of circular intermediates to the nucleus, 
microinjection experiments in oocytes are performed with 50 ng of plasmid 
DNA of circular intermediates with and without 50 ng of DBP mRNA. 
25 Expenments initially evaluate the time course of GFP transgene expression (+/- 
DBP cRNA) by direct fluorescent microscopy. If major differences are seen, 
quantitative fluorometry of individual whole oocytes in 96 well plates is 
conducted. Similar studies on nuclear targeting in the presence of DBP can also 
be e\ aluaied in this model by pooling microinjected oocytes for nuclear isolation 
30 and Southern blot analysis. 

A third experimental model to evaluate nuclear targeting and/or 
accumulation of circular intemiediate vectors in the presence and absence of 
DBP involves the microinjection of fluorescently labeled plasmid DNA into the 
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cytoplasm and real time imaging to follow the nuclear accumulation of DNA. 
The DNA fluorescent dye, TOTO-1, is used to label DNA prior to injection. 
This dye forms an extremely stable complex with negligible diffusion and re- 
incorporation into nuclear DNA following transfection into polarized airway 
5 epithelial cell monolayers. Co-localization of DBP with wtAAV DNA genomes 
at focal hot spots within the nucleus supports the observation that nucleolar 
targeting may be important for persistence. These experiments are also 
performed in primary airway epithelial cells and in vivo models of the airway by 
either co-transfection of circular intermediates with DBP expressing plasmids 

10 and/or mRN A. 

The effects of Rep co-transfection on the integration of circular 
intermediate plasmids is also evaluated. Two methods are used to express Rep 
including: 1) co-transfection with Rep expressing plasmids, and 2) co- 
transfection with Rep encoding mRNA. Initially, Hela, CFTl, and IB-3 cells are 

1 5 tested, as transformed cells may be more amenable to expansion and evaluation 
of integration. Both CFTl and IB-3 cells represent airway epithelial cells. 
Experiments are performed by cationic liposome (GL-67) mediated transfection 
of circular intermediate DNA with varying doses of a Rep-containing expression 
vector, e.g., pCMVRep. The extent of integration is also evaluated by two 

20 criteria. Southern blotting of Hirt and genomic DNA and clonal expansion of 
GFP expressing cells. Since Southern blot has an approximate limit of 
sensitivity of 1 integrated plasmid molecule per 10 cellular genomes, clonal 
expansion may be necessary to evaluate persistence in less transfectable cells 
such as CFTl and IB-3 cells. Cell Hnes are evaluated over the course of 1-10 

25 passages. 

Sustained expression of Rep by plasmid mediated co-transfection may be 
toxic to cells, hence co-transfection with Rep mRNA is also evaluated. Cationic 
liposomermRNA mediated transfection has been previously shown to work in 
cell lines and although the level of expression is much more transient than for 
30 DNA, in these studies it may be an advantage. Initial studies are performed with 
in vitro transcribed Rep mRNA alone to evaluate the ^ig amount of mRNA 
needed for Rep expression as determined by Western blot. Once the threshold 
for detectable Rep expression is established, increasing amounts of Rep mRNA 
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are co-transfected with circular intermediate DNA. Similar assays are used as 
described above to evaluate the extent of AAV circular intermediate integration. 
If findings suggest that increased integration if facilitated by Rep, the site 
specificity of this integration can be evaluated by cloning GFP expressing cells 
5 after the 10th passage by serial dilution. These GFP expressing clones are 
expanded and genomic Southem blots assessed with both GFP and AVSl 
specific probes. By evaluating a number of restriction enzymes which either do 
not cut or cut once within the circular intermediate plasmid, it will be determined 
whether integration has occurred at the AVSl loci. 

1 0 To test whether secondary structure rather than primary sequence is the 

important determinant of increased episomal stability of AAV circular 
intermediates, synthetic DNA sequences are generated with identical secondary 
structure to several ITR arrays in circular intermediates. The primary sequence 
is completely altered and bares no resemblance to sequences contained within 

1 5 native AAV ITRs. These synthetic DNA sequences are tested for their ability to 
confer increased episomal stability to heterologous plasmids in several model 
systems including: 1) the airway, 2) muscle, 3) and developing Xenopus 
embryos. The developing A'e/icpu^ embryo model is ideal for testing integration 
and persistence of plasmid based vectors for application of in utero gene therapy. 

20 If synthetic DNA sequences with similar secondary structure to IFRs are found 
to confer increased persistence to plasmid based vectors, then determinants for 
protein binding which facilitate persistence are independent of primary base 
sequence. These studies allow the optimization of the secondary structural 
requirements by synthesizing a wide range of DNA molecules with varying 

25 degrees of palindromic repeats. Furthemiore, the secondary structure may not 
bind proteins directly but faciUtate recombination of plasmids to large 
concatamers which have increased episomal stability or enhanced integration 
efficiencies. 

Example 5 

30 Delivery of Multiple Ge r\es, through Tntermnlecular Concatamerization 

Methods 

Recnmhtnant AAV vectors, 
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Two rAAV vector stocks were generated for use in these studies, 
AV.GFPSori (Example 1) and AV.Alkphos (also known as CWRAPSP, a gift of 
Dusty Miller) (Halbert et al., 1997). Virus stocks were generated by co- 
transfection of 293 cells with either pCisAV.GFP3ori orpCWRAPSP along with 
5 pRep/Cap, followed by co-infection with recombinant Ad.CMVlacZ helper virus 
(Example 2). rAAV was then purified through three roimds of CsCl density 
gradient centrifiigation as previously described by Duan et al. (1997). Purified 
viral fractions were heated at 60T for 1 hour to inactivate any residual 
contaminating helper adenovirus. The yields for AV.GFP3ori and AV.Alkphos 

10 were 1 x 10^^ and 7 x 10*^ particles per ml, respectively, as determined by slot 
blot hybridization with ^^P-labeled GFP or Alkphos probes. Infectious titers 
detemiined by infection of 293 cells with rAAVs were 1.1x10^ lU/ml 
(AV.GFP3ori) or 8.6 x lO^IU/ml (AVAlkphos), Controls testing for 
contamination of rAAV stocks with wtAAV by anti-Rep immunocytochemical 

1 5 staining in rAAV/Ad.CMVlacZ co-infected 293 cells were negative (limit of 
sensitivity is less than 1 infectious wtAAV particle per 10*^ DNA particles of 
rAAV). Similarly, histochemical staining for p-galactosidase in rAAV infected 
293 cells showed no detectable contamination with helper adenovirus in 10'^ 
DNA particles of rAAV (limit of sensitivity). 

20 Infection of mnsrle tissue and evah iatinn of transgene expression. 

The C57BL/6 mice used for these experiments were housed in a virus- 
free animal care facility and were maintained under strict University of Iowa and 
NIH guidelines, using a protocol approved by the Animal Care and Use 
Committee and facihty veterinarians. Four to five week old mice received 

25 bilateral 30 ^1 injections of a mixture of both AV.GFP3ori and AVAlkphos into 
the tibialis anterior muscle (5 x lO' DNA particles of each virus per muscle). 
Controls included iminjected muscles and muscles receiving injections of one of 
the viruses alone. At 14, 35, 80, and 120 days post-infection, animals were 
euthanized and tissues were harvested for evaluation of transgene expression and 

30 preparation of low molecular weight Hfrt DNA. For each experimental time 
point, at least 3 independently injected muscles were evaluated. 

In all experiments, GFP fluorescence was visualized in fleshly excised 
muscle tissue prior to processing. A portion of the same muscle was fixed with 
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2% paraformaldehyde in phosphate buffered saline, and cryoprotected in graded 
sucrose solutions before embedding in optimal cutting temperature medium 
(OCT). Sections (6 fim) were then evaluated for GFP expression directly and 
Alkphos expression following heat inactivation of endogenous Alkphos and 

5 histochemical staining for Alkphos activity (Engelhardt et al, 1995), To confirm 
dual localization of GFP and Alkphos expression in the same muscle fibers, 
either serial sections were evaluated for GFP and Alkphos expression or the 
same section was first photographed for GFP expression followed by 
histochemical staining for Alkphos and re-imaging of the same field. 

10 Rescue nf circular intenn fiHi^^tfts frnm muscle Hirt DNA. 

Low molecular weight Hirt DNA was prepared fi-om 20 mg specimens of 
injected muscles fi-om 3 animals at each time point (Example 2). Hirt DNA 
(4 |il; 1/5 of the total volume) was then used to transform 50 ^1 of 
electrocompetent SURE cells (Stratagene) using a BioRad £. coli electroporater 

15 and 0.1 ^m cuvettes. Colonies resulting firom each bacterial transformation were 
quantified, and plasmids firom 20 colonies firom each muscle Hirt DNA sample 
were purified for analysis. It should be noted that only circular forms carrying 
the Amp resistance gene and the bacterial origin of replication firom AV.GFP3ori 
are rescued by bacterial transformation (Duan et al., 1998). Control experiments 

20 reconstituting 5x10^** viral DNA particles into uninfected muscle extracts prior 
to Hirt DNA preparation failed to give rise to replication competent plasmids in 
the rescue assay (Duan et al, 1998). Additional controls in Duan et al. (1998) 
using AV.GFPSori virus also demonstrated that linear double stranded and single 
stranded purified viral DNA genomes do not give rise to replication competent 

25 plasmids following transformation into E coli, 

rharacterizfltmn of encnded genes in rescued circular intermediates. 

Several assays were used to characterize the extent of intermolecular 
recombination between independent circular viral genomes by evaluating the 
number and type of encoded genes in rescued plasmids fi-om Hirt DNA of 

30 muscles co-infected with AV.GFPBori and AV. Alkphos. Initial analysis 
involved the bulk evaluation of 60 rescued plasmids (20 fi-om each of three 
muscle samples for each time point) by dot blot hybridization of mini-prep DNA 
with EGFP, Alkphos, and Amp ^^P-labeled DNA probes. In these studies, Amp 
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hybridization served as a control to show that there was a sufficient quantity of 
DNA for the analysis. The percentages of Alkphos and/or GFP hybridizing 
plasmids were calculated by this method for each muscle sample. From this 
percentage, the total number of plasmids hybridizing to each probe in the Hirt 

5 DNA sample was calculated from the total CPU obtained in each transforation. 
In this analysis, each muscle sample was evaluated independently to detemiine 
the mean (+/-SEM) total Alkphos and/or GFP hybridizing plasmids. A second 
evaluation involved the transfection of rescued plasmids into 293 cells using 
lipofectamine, followed by evaluation of GFP fluorescence and histochemical 

1 0 staining for Alkphos. To confirm that GFP and Alkphos co-expressing plasmids 
were indeed clonal and that both genes were encoded on the same plasmid, a 
selected group of five co-expressing plasmids were retransfomied into Exoli and 
colonies were re-isolated prior to repeating the transfection studies. In all cases, 
plasmids co-expressing the two reporter genes remained clonal through this 

1 5 subsequent re-isolation. 

!^tnirtiira1 anfllysig nfcnnca tamer r AAV drcular intermediates. 

To ftuther characterize the nature of isolated circular intermediates co- 
expressing both GFP and Alkphos transgenes, plasmid structure was mapped by 
Southern blotting and restriction enzyme analysis. The structural of five co- 

20 expressing circular intermediate plasmids were detemiined by digestion with 
Ahdl, Hindin, NotI, Hindlll/NotI, Clal/Asel, and/or SnaBI and Southem 
blotting was performed with ^^P-labeled GFP, Alkphos, and ITR probes. 
Results 

Strategy for chara c tftn7mg mechanisms of rA AV circular intcnned i ate 

25 formation. 

Efficient circularization of rAAV genomes has been previously 
demonstrated to occur in muscle in a time dependent fashion (Example 2). 
Furthermore, the conversion of monomeric to multimeric circular rAAV 
intermediates occxm-ed over time and was associated with long-tenn episomal 

30 persistence of AAV genomes. High molecular weight AAV circular genomes 
might form by either of the following two mechanisms, one involving the 
replication of monomer structures and the other through intermolecular 
recombination between independent monomers. A rescue assay was developed 
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using two separate rAAV vectors, AV.GFP3ori and AV.Alkphos (Figure 14 A), 
which allowed for the identification of independent viral genomes through 
unique transgenes. In this assay, circular form genomes were rescued in bacteria 
by virtue of Amp/ori sequences encoded in one of the two vectors 
5 (AV.GFP3ori). A method for characterizing the extent of interaiolecular 
recombination between independent circular rAAV genomes was shown in 
Figure 14B. 

rn-fiYprpgginn of iTidfipendent l y Pinrndad rAAV transpenftS in muscle mvofibers . 
To confirm that myofibers can be co-infected at a high efficiency with 

1 0 the two rAAV vectors, the tibiahs anterior muscle of mice was co-infected with 
5x10^ DNA particles of both AV.GFP3ori and AV.Alkphos. At 14, 35, 80, and 
120 days post-infection, muscles were harvested and analyzed for transgene 
expression. Transgene expression fi-om both reporters was weak but clearly 
visible in 14 day muscle samples. By 80 days post-infection, transgene 

15 expression was maximal and serial sections demonstrated expression of both 
AJkphos and GFP transgenes in overlapping regions of the muscle (Figures 15A- 
C). At this time point, approximately 50% of the fibers in the tibialis muscle 
expressed both transgenes. To confirm that co-infection of myofibers occurred 
with the two independent vectors, co-localization studies were performed on 

20 muscle sections by a serial staining procedure. These studies, depicted in Figure 
15D, demonstrate four classes of myofiber transgene expression: 1) GFP positive 
only, 2) Alkphos positive only, 3) GFP/Alkphos positive, and 4) no transgene 
expression. The largest fi-action of myofibers expressed both GFP and Alkphos 
transgenes. These results confirm that at the titers of virus used for infection, co- 

25 infection occurred in greater than 90% of transgene expressing myofibers. 
l^ftj^riift of hufimntinnal rAAV rirr.nlar in termediates increases overtime. 

To determine the extent of recombination between circular AAV 
genomes, circular form genomes were rescued as plasmids fi^om low molecular 
weight Hirt DNA of muscle tissue co-infected with AV.GFP3ori and 

30 AV.Alkphos. Following transformation ofExoli Sure cells with Hirt DNA 
purified firom infected muscles, the total number of GFP and Alkphos 
hybridizing Amp resistant bacterial plasmids was quantitated for each time point 
post-infection (Figure 16A and B) (Duan et al., 1995), the abundance of circular 
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AAV genomes rescued from AV.GFP3ori increased over time. For each muscle 
sample (three for each time point) twenty plasmid clones were evaluated for 
hybridization to GFP and Alkphos DNA probes and the total number of plasmids 
was back calculated from the total CPU for each individual muscle sample. 

5 Figure 1 6B demonstrates the mean (+/-SEM, N=3) total plasmids that hybridized 
to GFP or GFP/Alkphos probes at each time point. At 14 days post-infection, 
GFP/ Alkphos co-hybridizing plasmids were never observed. In contrast, at time 
points after 35 days the percentage of GFP/Alkphos co-hybridizing plasmids 
increased with time and reached 33% by 120 days (Figure 16C). Since bacterial 

1 0 plasmid rescue can only occur through AV.GFP3ori genomes, this data suggests 
that recombination between independent Alkphos and GFP rAAV genomes takes 
place over time. These results are consistent with studies described hereinabove 
demonstrating a time dependent concatamerization of monomer circular rAAV 
genomes in muscle. 

15 To evaluate the ability of circular intermediates to express encoded 

transgenes, transient transfection studies were performed in 293 cells with 
rescued circular intermediate plasmids (Figures 17A-C). Between 85-90% of 
rescued plasmids hybridizing to GFP probes on slot blots also expressed the GFP 
transgene in this transfection assay (Figure 17D), The percentage of GFP 

20 expressing plasmids that also expressed Alkphos rose over time in concordance 
with the hybridization data (Figure 17D). However, approximately 40-50% of 
plasmids which were hybridization positive for Alkphos did not express the 
Alkphos transgene. This may represent recombinational deletion of the RSV 
promoter drivmg Alkphos expression which occurred during concatamerization 

25 at sites near the 5' FTR. These results demonstrate that intermolecular 

recombination between Alkphos and GFP derived circular intennediates occurs 
as part of the time dependent concatamerization process of rAAV in muscle. To 
confirm that amplified plasmids stocks expressing both reporter genes were 
actually clonal (i.e., one plasmid rather than two independent plasmids resulting 

30 from contamination), a select number of bacterial clones expressing both 
transgenes were re-isolated and the transfection assays were repeated. In all 
cases, plasmids expressing the two reporter genes remained clonal through two 
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rounds of bacterial cloning. Hence, dual reporter expression was not due to 
contamination of independent GFP and Alkphos expressing plasmids. 
ronc^afflTneri7ation of A A V r.ircniar intemiedifltfts nrcurs throilph uniform 
intermnlftnular remmhination Kfttwftpn TTT^s of indppfmdent viral genomes . 

5 To better understand the mechanisms of circular concatamer formation, a 

detailed structural analysis was performed of five bi-functional circular 
concatamers isolated fix}m rAAV infected muscle samples. As previously 
described for the AV.GFP3ori genome (Example 2), the conversion of 
monomeric circular AAV genomes to large multimeric circular concatamers with 

10 a predominant head-to-tail structure increased with time in muscle. To evaluate 
the structure of bi-fimctional circular concatamers, restriction enzyme mapping 
and Southern blot analysis using ^^P-labeled EGFP, Alkphos, and ITR probes 
was employed. Results from five analj^ed plasmids demonstrated between 3-6 
genomes within these circular concatamers. Two representative structures from 

15 35 and 80 day time points are shown in Figure 1 8. Several interesting 

conclusions can be made from this structural analysis. As described , head-to- 
tail oriented genomes could be seen in all isolated concatamers. However, 
several examples of head-to-head and tail-to-tail genome combinations of 
AV.Alkphos and AV.GFP3ori were also seen. Since head-to-head and tail-to- 

20 tail genome concatamers were never seen in muscles infected with AV.GFP3ori 
alone, there must be a selective disadvantage for bacterial replication when ori 
sequences are in either of these conformations. However, since the AV.Alkphos 
genomes do not contain a bacterial origin of replication, this orientation is 
permitted in chimeric concatamers. Second, noticeable deletions and/or loss of 

25 restriction sites close to ITRs were noted (Figure 1 7). It is not known whether 
deletions close to the ITR are a common event in the concatamerization process, 
but if so, this could account for the fact that only 60% of GFP/Alkphos 
hybridizing circular intemiediates also expressed the Alkphos transgene. 
Discussion 

30 Concatamerization of rAAV genomes has long been recognized in 

integrated proviral genomes. Recently, the association of this concatamerization 
process with the formation of high molecular circular genomes in muscle has 
suggested that this process may also be important in episomal persistence. The 
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findings described herein demonstrated rescue of independent viral genomes 
within the same circular concatamer, suggesting that this process of 
concatamerization occurs through intermolecular recombination. Furthennore, 
at 14 days the predominant form of viral genome in muscle was circular 
5 monomers (Example 2), which correlates with the results described above 

demonstrating only GFP expression in rescued circular intemiediates at this time 
point. Together with the fact that bi-fimctional rescued circular concatamers 
increase with time, these results suggest that large concatamers form by 
recombination of monomeric circular precursor genomes. Furthermore, since an 
1 0 alternative model of concatamerization by rolling circular replication would be 
expected to yield only GFP expressing rescued plasmids in this system, this 
mechanism does not appear responsible for concatamerization. 

Based on the structural analysis of these bi-fimctional circular 
intermediates, recombination between monomeric circular rAAV genomes is 
1 5 likely facilitated through ITR sequences. Directionality of this recombinational 
event does not appear to play a significant role, since head-to-tail, head-to-head, 
and tail-to-tail oriented intermolecular concatamers were found. In addition, the 
extent to which recombination within ITR repeat regions occurs in bacteria is 
presently unknown and may account for the deletions and/or restriction site 
20 losses near ITR arrays. However, serial passaging of bi-fimctional circular AAV 
genomes in bacteria has suggested that the structure of these large concatamers is 
impressively stable in bacteria. 

Intermolecular recombination of rAAV genomes to form single circular 
episomes may be particularly usefiil for gene therapy. For example, large 
25 regulatory elements and genes beyond the packaging capacity of rAAV may 
become linked after co-infecting tissue with two independent vectors 
(Figure 19). This strategy could also involve trans-splicmg vectors encoding two 
independent regions of a gene which are brought together to form an intact 
spUcing unit by circular concatamerization. 
30 For example, two independent vectors encoding two halves of the CFTR 

gene flanked by donor and acceptor sphce site sequences are prepared. 
Expression of fimctional CFTR protein results after splicing of RNA transcribed 
fi-om a concatamerized genome comprising both halves of the gene in the sense 
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orientation. One rAAV vector may comprise the first 3.3 kb of the CFTR gene 
under the control of the RSV promoter and an in-firame sphce donor site at the 3' 
end of the CFTR cDNA. The second rAAV vector encodes a sphce acceptor 
intronic sequence, the 3' 1.4 kb of the CFTR gene, and SV40 poly-adenylation 

5 sequences. To test for efficient spUcing, a chimeric vector (pcDNA3 . 1 CFTR- 
Donor/ Acceptor) is introduced to Xenopus oocytes by nuclear injection of the 
vector, followed by two electrode voltage (TEV) clamp recording fimctional 
analysis of CFTR (Jiang et al., 1998), mRNA transcripts are also analyzed for 
correct splicing following transfection of pcDNA3.1CFTR-Donor/Acceptor into 

10 MDCK cells. Polarized airway epithelial cells grown at the air-hquid interface 
are co-infected witii the donor and acceptor CFTR AAV vectors CFTR gene 
expression in these cells is then monitored by both immunofluorescent 
localization and fimctional analysis of short circuit currents (Smith et al., 1992; 
Smith et al., 1990). Hirt analyses of episomal AAV species are used to correlate 

1 5 the efficacy and persistence of CFTR gene expression with the formation of 
AAV circular intermediates. 
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WHAT IS CLAIMED IS: 

1 . An isolated and purified DNA molecule comprising at least one DNA 
segment, a biologically active subunit or variant thereof, of a circular 
intermediate of adeno-associated virus, which DNA segment confers increased 
episomal stability, persistence or abundance of the isolated DNA molecule in a 
host cell. 

2. The DNA molecule of claim 1 in which the DNA segment comprises at 
least a portion of a 5' inverted terminal repeat of adeno-associated virus. 

3. The DNA molecule of claim 1 in which the DNA segment comprises at 
least a portion of a 3 '-inverted terminal repeat of adeno-associated virus. 

4. The DNA molecule of claim 1 which further comprises a marker or 
selectable gene. 

5. A plasmid comprising the DNA molecule of claim 1 . 

6. A gene transfer vector, comprising: 

a) at least one first DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability or integration of the vector in a host 
cell; and 

b) a second DNA segment comprising a gene. 

7. The vector of claim 6 in which the first DNA segment comprises at least 
about 550 bp of adeno-associated virus sequence. 

8. The vector of claim 6 in which the first DNA segment comprises at least 
about 400 bp of adeno-associated virus sequence. 

9. A therapeutic gene transfer vector, comprising: 
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a) at least one first DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability or integration of the vector in a host 
cell; and 

b) a second DNA segment comprising a gene encoding a 
therapeutically effective polypeptide. 

1 0. A method of delivering a gene to a cell comprising: contacting the cell 
with the vector of claim 6 or 9. 

11. A composition for deUvering a gene to a cell, comprising: the vector of 
claim 6 or 9 and a delivery vehicle. 

12. The composition of claim 1 1 wherein the delivery vehicle is a 
pharmaceutically acceptable carrier. 

13. The composition of claim 1 1 wherein the delivery vehicle is a liposome. 

14. The DNA molecule of claim 1 which comprises concatamers of the 
circular intermediate. 

15. The DNA molecule of clahn 1 in which the stability, persistence or 
abundance of the DNA in cells is enhanced by a DNA binding protein. 

16. The DNA molecule of claim 15 wherein the DNA binding protein is 
adenovirus E2a 

17. A host cell comprising the vector of claim 6 or 9. 

18. A host cell comprising the DNA molecule of claim 1 , 

19. An animal comprising the vector of claim 6 or 9. 
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20. The animal of claim 19 which is not a human. 

21 . A method of expressing a gene product in the muscle tissue of an animal, 
which comprises: administering the vector of claim 6 or 9 to the muscle tissue of 
said animal in an amount effective to express the gene. 

22. The method of claim 2 1 wherein the vector is administered dissolved or 
suspended in a liquid pharmaceutically acceptable carrier. 

23. The method of claim 22 wherein said liquid carrier comprises an aqueous 
solution. 

24. The method of claim 21 wherein said gene comprises a DNA segment 
encoding a protein operably linked to a promoter operable in said muscle tissue. 

25. The method of claim 21 wherein said administering is by intramuscular 
injection. 

26. The method of claim 21 wherein said administering is by transdermal 
transport. 

27. The method of claim 21 wherein said animal is a bird or mammal. 

28. The method of claim 1 wherein said animal is a human. 

29. A method of expressing a gene in a eukaryotic cell, comprising: 

a) transfecting a eukaryotic host cell susceptible to adenovirus 
infection with the vector of claim 6 or 9 and a recombinant adenovirus helper 
vector so as to form packaged viral particles; and 

b) infecting a eukaryotic host cell with the viral particles in an 
amount effective to detect expression of the gene. 

30. A composition comprising: 
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a) a first adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a 5 '-inverted terminal 
repeat of adeno-associated virus; 

ii) a second DNA segment comprising at least a portion of an 
open reading fi"ame operably linked to a promoter, 
wherein the DNA segment does not comprise the entire 
open reading fi"ame; 

iii) a third DNA segment comprising a splice donor site; and 

iv) a fourth DNA segment comprising a 3 '-inverted terminal 
repeat of adeno-associated virus; and 

b) a second adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a 5 '-inverted terminal 
repeat of adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising at least a portion of an 
open reading fi^e which together with the DNA segment 
of (a)(ii) encodes a full-length polypeptide; and 

iv) a fourth DNA segment comprising a 3 '-inverted terminal 
repeat of adeno-associated virus. 

3 1 . The composition of claim 30 further comprising a delivery vehicle. 

32. A method to transfer and express a polypeptide in a host cell comprising 
contacting the host cell with the composition of claim 30. 

33. A method to transfer and express a polypeptide in a host cell comprising 
contacting the host cell with a first adeno-associated virus vector comprising 
linked: 

i) a first DNA segment comprising a 5 '-inverted terminal repeat of 
adeno-associated virus; 

ii) a second DNA segment comprising at least a portion of an open 
reading fi^e operably linked to a promoter, wherein the DNA 
segment does not comprise the entire open reading fii^e; 
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iii) a third DNA segment comprising a splice donor site; and 

iv) a fourth DNA segment comprising a 3 '-inverted terminal repeat 
of adeno-associated virus. 

34. The method of claim 33 wherein the host cell is further contacted with a 
second adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a 5 '-inverted terminal repeat of 
adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising at least a portion of an 
open reading frame which together with the DNA segment 
of (a)(ii) encodes a full-length polypeptide; and 

iv) a fourth DNA segment comprising a 3 '-inverted terminal repeat 
of adeno-associated virus. 

35. The method of claim 32 or 33 wherein the host cell is a lung epithelial 
cell, a muscle cell or a neuron. 

36. The method of claim 32 or 34 wherein the polypeptide is the CFTR 
polypeptide. 

37. A method of expressing a gene product in the muscle tissue of an animal, 
comprising contacting the muscle tissue with the composition of claim 30 in an 
amount effective to express the polypeptide. 

38. A method of expressing a gene product in the airway epithelia of an 
animal, comprising contacting the airway epithelia with the composition of claim 
30 in an amount effective to express the polypeptide. 

39. A method of expressing a gene product in the neurons of an animal, 
comprising contacting the neurons with the composition of claim 30 in an 
amount effective to express the polypeptide. 
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40. The use of the vector of claim 6 or 9 for the manufacture of a 
medicament for the treatment of a pathological condition or symptom in a 
mammal. 

41 . The use of the composition of claim 30 for the manufacture of a 
medicament for the treatment of a pathological condition or symptom in a 
mammal. 
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1 

SEQUENCE LISTING 

<110> University of Iowa Research Foundation et. al. 

<120> Adeno-associated virus vectors and uses 
thereof 

<130> 875.007WO1 

<150> US 09/276,625 
<151> 1999-03-25 

<150> US 60/086,166 
15 <151> 1998-05-20 

<160> 7 

<170> FastSEQ for Windows Version 3,0 



10 



20 



25 



<210> 1 
<211> 20 
<212> DNA 

<213> Adeno-associated virus 



<400> 1 

20 



cgggggtcgt tgggcggtca 

<210> 2 
30 <211> 19 

<212> DNA 

<213> Adeno-associated virus 
<400> 2 

19 

35 gggcggagcc tatggaaaa 

<210> 3 
<211> 505 
<212> DNA 
40 <213> Artificial Sequence 
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<220> 

<223> A consensus sequence of inverted terminal repeats 
<400> 3 

5 cgggggtcgt tgggcggtca gccaggcggg ccatttaccg taagttatgt aacgactgca 
ggcatgcaag ctcgaattca tcggtagata agtagcatgg cgggttaatc attaactaca 
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcgctgagg 
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 
gagcgcgcag ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg 
10 ggcgaccttt ggtcgcccgg cctcagcgag cgagcgagcg cgcagagagg gagtggccaa 360 
ctccatcact aggggttcct tgtagttaat gattaacccg ccatgctact tatctacagc 420 
ttgcatgcat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 
tggcgttttt ccataggctc cgccc 

15 <210> 4 

<211> 272 
<212> DNA 

<213> Adeno-associated virus circular intermediate, clone p81 



20 <400> 4 

gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcggccaa 
aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagag 



25 acttatctac cgatgaattc gagcttgcat gc 



60 
120 
180 
240 
300 



480 
505 



60 
120 
180 



agggagtggc caactccatc actaggggtt ccttgtagtt aatgattaac ccgccatgct 240 



272 



60 



<210> 5 
<211> 300 
<212> DNA 

30 <213> Adeno-associated virus circular intermediate, clone p81 

<400> 5 

gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgcgcgc 120 
35 tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga gcccgggctt tgcccgggcg 180 
gcctcagtga gcgagcgcgc gcgcagagag ggagtggcca actccatcac taggggttcc 240 
ttgtagttaa tgattaaccc gccatgctac ttatctaccg atgaattcga gcttgcatgc 

<210> 6 
40 <211> 272 



300 



BNSDOCID: <WO 99601 46A1 IA> 



wo 99/60146 



PCT/US99/11197 



<212> DNA 

<213> Adeno-associated virus circular intermediate, clone p81 
<400> 6 

5 gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa 
aggtcgcccg acgcccgggc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag 
agggagtggc caactccatc actaggggtt ccttgtagtt aatgattaac ccgccatgct 
acttatctac cgatgaattc gagcttgcat gc 

10 

<210> 7 
<211> 165 
<212> DNA 
<213> Unknown 



60 
120 
180 
240 
272 



15 



<220> 

<223> SEQ ID NOil of U.S. Patent No. 5,478,745 



<400> 7 

20 aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 
gagcgcgcag agagggagtg gccaactcca tcactagggg ttcct 



60 
120 
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